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ABSTRACT 


INTER- OBSERVER AGREEMENT AND MODELS 
OF MONAURAL AUDITORY PROCESSING 
IN DETECTION TASKS 
by 

Gordon Wells Wilcox 

Previous studies have shown that the output of an electronic energy 
detector is correlated with the responses of human observers in monaural 
auditory detection tasks. However, an experiment performed by Ahumada 
showed that when the signal (pure tone) is present in the observation 
interval the correlation is greatest if the filter in the energy detec- 
tor has a considerably narrower bandwidth than is necessary for maximum 
correlation when only noise is presented. Since this result is incon- 
sistent with a simple energy detection model of auditory processing, 
Ahumada proposed a filter- bank model to account for his findings. 

The present study presents a quantitative development of the linear- 
uncertain model which is a generalization of the energy model. The form 
of the linear-uncertain model is derived from the assumption that the 
observer is uncertain regarding the exact specification of the signal 
he is trying to detect. The decision variable of this model is the 
weighted sum of a linear function and a quadratic function of the input 
waveform. The relative weight of each component is determined by the 
observer' s level of uncertainty regarding the signal. It is -shown that 
the linear-uncertain model includes as special cases the linear, energy, 
and envelope models for auditory processing and like the filter-bank 
model can give an explanation of Ahumada' s findings. 



Predictions of the level of correlation between observers derived 


from the linear -uncertain model and the filter-bank model are compared 
for several experimental conditions. In the experiment the decisions 
of human observers are compared with the outputs of two electronic devices 
The first device is an analog multiplier which computes the cross- 
correlation between the signal and the noise waveform sample on each trial 
The second device is an energy detector which computes the energy of 
the noise waveform sample during the presentation interval in a narrow 
frequency band centered at the signal frequency. 

The linear -uncertain model predicts that the correlation between the 
human observers and the cross-correlation is not zero and should increase 
when a continuous sinusoid is added to the background noise. Neither of 
these predictions is verified. Since the energy detector receives only 
noise at input in this experiment, the model also predicts that the 
correlation between the observers and the energy detector should be less 
on trials when signal is present than when it is not present. The results 
show the correlations to be weakly significant in the opposite direction. 
It is concluded that the linear-uncertain model and its special cases, 
the linear, energy, and envelope models, represent an inadequate approxi- 
mation to the actual form of human monaural auditory processing In detec- 
tion tasks. 

Predictions from the filter-bank model agree with the above results, 
but cannot account for an observed decrease in inter-observer correla- 
tions when the signal presentation interval is shorted. A modification 
of the filter -bank model is suggested to account for this discrepancy. 



A final result remains unexplained by any of the models considered. 

A decrease in inter-observer correlations is found when a continuous 
sinusoid is added to the background noise for either of two signal dura- 
tions. It is emphasized that this unexpected finding implies that there 
is a serious deficiency in current models of monaural auditory processing 


in detection tasks. 
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CHAPTER X 


DIRECT COMPARISONS BETWEEN OBSERVERS 
1 » 1 Introduction 

Many auditory detection experiments have investigated changes in 
observer performance as the result of variation of the physical parameters 
of the input waveform or characteristics of the observer's task. Measures 
of performance which have been used include indices of the quality of the 
observer's decisions and studies of the shape of psychometric functions 
and receiver operating characteristic curves. The physical parameters 
that have been varied include frequency, phase, amplitude, and duration 
of a signal, spectral characteristics of background noise, signal-to-hoise 
ratio, and addition of pulsed carriers and continuous waves to the noise. 
Task characteristics which have been studied include single, double, etc., 
and random presentation intervals , pulsed versus continuous background 
noise, binary and multiple response categories, payoff structures, and 
probability of signal occurence (cf. Swets, 1964; Green and Swets, 1966). 

The performance measures in these studies I shall call indirect 
comparisons between observers. Probability of a correct response and 
the area under the ROC curve compare the observer's performance with the 
specification of experimenter defined hypotheses, d' compares observer 
performance with an optimum observer for the same task. Studies of the 
shape of the psychometric function, ROC curve, or "iso-bias" curve all 
compare observer performance with a model-specified in a similar task (a 
Glossary of Symbols and Terms appears in Appendix III). These comparisons 
are indirect because they do not compare observers on a trial-by-trial basis. 

Direct comparisons between observers do compare inter- or intra-observer 
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~ 2 - 

performance on individual trials. Green (1964b) has termed attempts to 
predict single trial performance "molecular psychophysics" as contrasted 
with molar psychophysics. In this sense, direct comparisons can be either 
molar or molecular depending upon the use to be made of the performance 
measures . 

The purpose of using different kinds of comparisons is the same: 

. . . the ultimate aim of detection theory is to discover the exact 
form of the distributions [of sensory events] using (1) the physical 
parameters of the signal and noise and (2) the character of the 
sensory detector .. .The major obstacle of this endeavor is... our 
ignorance about the nature of the sensory detector (Green and 
Swets, 1966, pp. 53-54). 

1.2 Comparison of Observer Performance with an Electronic detector 

The first direct comparisons between human' observers in an auditory 
detection task and that of an electronic device were made by Sherwin, 

Kodman, Kovaly, Prothe, & Melrose (1956). They recorded on magnetic 
tape 50 samples of a 1 kHz burst of several durations (0.03, 0.10, 0.30, 
and 1.0 seconds) in four different listening conditions. The tone bursts 
occurred at randomly spaced intervals in a background of noise (0-4 kHz) . The 
tapes were played back through earphones to observers and simultaneously 
through a 60 Hz wide passive filter, centered at 1 kHz. The output of 
the filter served as the input to a square-law detector and exponentially 
decaying integrator. The time constant of the integrator was set at one- 
half the tone duration for approximately optimum detection (sic) . The final 
output of the electronic detector was recorded on a pen recorder which 
marked the times that signal was present. 

The amplitude of the signal was adjusted so that the observers had 
a hit rate ("correct detections") of approximately 60% at each condition 
of signal duration. A criterion for the output of the electronic device 
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could be set by the experimenters so that it also had a 60% hit rate, thus 
matching the perf ormance of the device with the human observers during 
the presence of a signal. They found that when this match was made, the 
false— alarm rates ("incorrect detections") for all observers were lower 
at each signal duration than for the electronic device, with the smallest 
difference at a duration of 0.30 seconds. It was also for this duration 
that the sample distributions of the output of t,he device, conditional 
upon observer hits and misses, respectively, were most highly separated. 

That is, at 0.30 second duration the hits and misses of the electronic 
device had the highest correlation with the hits and misses of the observers . 
Unfortunately, a similar statement could not be made for intervals during 
which the signal was not present. In fact there appeared to be no associa- 
tion between the observers' responses and the output of the device during 
noise intervals, which is partly in evidence from the fact that the 
observers maintained lower false alarm rates than the device. The authors 
suggest that using a filter about 30% narrower that the one they employed 

would have led to approximately the same performance for the device and 
the observers. This in itself, of course, would not demonstrate a closer 
association between an observer's decision variable and the average power 
statistic computed by the device. It is possible that improving the 
performance of the device would not improve the correlation during noise 
intervals . 

1.3 Association of Observer Performance with an Energy Detector 

The next experiment involving direct Comparisons was reported in 
Watson's Ph.D. thesis (1962). The experimental method made several 
improvements over the procedure used by Sherwin, et at. Discrete 
observation intervals 0.25 seconds long containing pulsed noise waveform 
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samples were used. A signal, again a 1 kHz tone, was added to the noise 
on approximately one-half of the observation intervals. The observers 
could make one of four responses on each trial indicating their confi- 
dence that signal had or had not been presented. There was no immediate 
feedback as to whether the response was correct or not. The input to the 
observers was simultaneously recorded on magnetic tape for further analysis. 
This analysis was made by passing the pulsed waveforms through a filter 
approximately 100 Hz wide centered at 1 kHz and then measuring the number 
of times voltage peaks from each burst exceeded each of eleven different 
levels. A "voltage contour" was defined as the percentage of times 
voltage peaks exceeded a given voltage level plotted versus the voltage level. 
The area under the voltage contour was interpreted as an approximate 
index of the energy in a 100-cycle band centered on the signal' frequency . 
Voltage contours could then be plotted conditional upon the observers’ 
responses and upon whether or not signal was presented, averaging across 
all bursts in one of these eight conditions. The results showed that the 
area under a voltage contour was monotone increasing with the rating 
response on both trials containing signal and those which did not contain 
signal. No attempt was made to predict trial by trial responses of the 
observers based on the energy statistic. However, two observations about 
the data were made. First, consistent with the assumptions of the theory 
of signal detectability an observer's decision variable could be ordered 
on the basis of a physical parameter of the stimulus. Second, the results 
showed that there was association between the responses of an observer 
and the physical parameter on both signal and noise-alone trials , contrary 
to the findings of Sherwin, et al. 

Watson had stated in his introduction " — the energy within the 
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critical band has been shown in theory and by experiment to be a primary 
parameter of the auditory stimulus [in masking experiments]". Nevertheless, 
it is. apparent from Watson’s data that the observers’ responses were not 
determined by the area under the voltage contour alone, since every 
response occured at every voltage level. It could be postulated, as did 
Sherwin, et al. 3 that the relevant physical parameter had been, determined, 
but that the observers also had fluctuating criteria for making their 
responses. Equally well, in terms of these two experiments, it might 
be hypothesized that association between the measured physical statistic 
and the responses was due to the fact that both the statistic and the 
responses were associated with a physical parameter not measured in' the 
experiment. Such a parameter might account for the trial by trial 
responses of the individual without assuming a fluctuating criterion. 

1.4 Intra-Observer Consistency 

Green (1964b) fully realized this latter possibility and set out 
to determine the level of inconsistency of an observer’s response regard- 
less of the relevant -physical parameters of the stimulus. Pairs of 
noise samples without signal in either interval were recorded and inter- 
spersed with pairs containing signal in one of the intervals. In this 
2-interval forced-choice task, percentage self-agreement scores were 
determined on the no-signal pairs by having each of these pairs presented 
twice in the experimental session. 

In Green's first experiment the "signal" was an increment of power 
in the wide-band background noise. Three signal-to-noise ratios were 
used. The results indicated that percentage agreement of responses to 
identical pairs of no-signal samples was approximately 65% for each 
observer regardless of the percentage of correct detection responses 
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which averaged 60%, 74%, and 91% for the three S/N ratios, respectively. In 
a second experiment the signal was a 0.10 second gated sinusoid at a 
frequency of 250, -500, 1000, or 2000 Hz in four separate conditions. The 
average percentage self -agreement scores across observers and tapes was 
approximately 70% at all frequencies except for the 1000 Hz condition, for 
which the average was 78%. Several other experimental conditions were 
studied with the same general result: self -agreement scores differed little 

from 70% although there was a small but consistent trend for tapes using 
sinusoidal signals to generate somewhat higher scores than tapes for which 
the signal was an increment in the background noise. Green conjectured, 
but could not measure in his experiments, the possibility that the incon- 
sistency could be attributed to several causes. The first would be 
response bias — either pure, i.e., preference for the first or second 
interval, or sequential, i.e., a bias depending upon the previous response 
or previous feedback. Another possible cause is "internal noise" which 
has been variously defined as a fluctuation in the observer's criterion 
or noise .in his sensory apparatus. Green made several calculations which 
indicated that response bias effects should be small, and therefore 
proceeded to estimate the level of internal noise which would lead to the 
observed percentage agreement scores. His calculations suggested that, 
as a first approximation and minimum estimate, the ratio of external to 
internal noise is about 1.0. No thesis was advanced to explain the 
difference in agreement scores as a function of the kind of signal pre- 
sented. Green concluded that this 70 % consistency, since it appears 
independent of the sensory task, represents an upper bound on any attempt 
to predict trial-by-trial human detection responses . 

Apparently following a suggestion made by Green in his article, 
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Pfafflin and Mathews (1966) studied the consistency of observer responses 
to computer generated no-sign'al noise pairs, some of which had an iden- 
tical noise sample as each member of the pair. In addition, they added 
a 312.5 Hz tone to one or the other member of some of the pairs. The use 
of computer-generated and-controlled stimuli provided relatively accurate 
reproduction of waveform samples and permitted the presentation, of stimulus 
pairs in a large number of different orders , with the hope of greatly 
reducing any possible effect of response biases. A spectral analysis was 
made on each of the 12 samples of noise used in the experiments. The 
spectral analysis determined the relative energy in frequency bands- approxi- 
mately 100 Hz wide as a function of the center frequency. It was thus 
possible to determine the relative energy difference in a frequency band 
100 Hz wide centered at the signal frequency of 312.5 Hz between the 
members of each pair of waveforms. 

The probability of correct detection tended to increase, with con- 
siderable scatter, for three observers as a function of the relative 
energy difference for pairs containing a signal, whether or not the indi- 
vidual noise samples in the pair were identical. However, some inversions 
(a response preference for the member of the pair with the lower rela- 
tive energy level) occurred, most often for pairs with low relative energy 
difference. The authors could not find inter-observer agreement on inver- 
sions, nor could they find relevant physical parameters that might account 
for inversions of a single observer. Another experiment was performed 
using the same noise samples , where this time the observers were asked 
to judge if either interval contained a signal.' Since the preference 
probability tended to increase with the probability that a pair was 
judged to contain a signal for signalless different-noise pairs, it 



was concluded that at least for these different-noise pairs, the obser- 
ver must have been trying to detect a signal in the previous experi- 
ment. Corresponding results were not conclusive for signalless identical- 
noise pairs. This could have been due to the procedure under which 
the observers knew that some pairs contained no signal. The authors 
concluded that the energy increment produced by the signal, or some 
quantity closely related to it, is the chief physical parameter relevant 
to detection behavior. They thus agreed with Watson on the nature of the 
sensory processing. 

Pfafflin and Mathews' major contribution to Green's original effort 
on intra-observer consistency would seem to be the demonstration that the 
percent agreement for signalless pairs is dependent on the particular noise 
pair and perhaps, although this is not demonstrated conclusively, dependent 
upon some physical parameter of the noise samples. Estimating by eye, 
it would appear from Pfafflin and Mathews’ Figure 4 that the average per- 
cent-agreement for signalless different-noise pairs is about 60-70% for 
each observer, in agreement with Green's results. 

1.5 Correlation of Observer Decisions with a Variable -Bandwidth Energy Detector 

The most recent study involving direct comparisons is reported in 
Ahumada's Ph.D. thesis (1967). In a context of attempting to measure criti- 
cal bandwidths directly, Ahumada had observers respond ("Yes" or "No") to 
single 100 msec noise bursts which sometimes contained a superimposed tone, 
as in Watson's experiment. An electronic detector with specifiable bandwidth 
was simulated on a digital computer as in the Fhafflin and Mathews study. 

The observers' average responses from 5 replications to the same stimulus 
were correlated (Spearman's rank correlation coefficient corrected for ties) 
with the outputs of the simulated energy detectors. The correlation was 
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computed separately for signal-plus-noise and noise-alone trials to determine 
the bandwidth for which there was maximum correlation. The bandwidth 
associated with the maximum correlation should provide a fairly direct 
estimate of the critical bandwidth. A surprising result was found. A 
filter with a 10 or 20 Hz bandwidth had maximum correlation with the 
observers’ responses for signal-plus-noise trials. But on noise trials 
a wider filter with 100 to 200 Hz bandwidth had maximum correlation. This 
result , of course, can not be predicted from the simple energy-detection 
model. Such a model predicts that the same width filter should correlate 
best with responses to signal-plus-noise and noise-alone stimuli. Ahumada’s 
finding deals a serious blow to the conclusion that the energy in a 
narrow band is the primary physical parameter. The most that can be concluded 
is that the energy in a given band is associated with the physical determinants 
of detection behavior. 

Ahumada suggested a "filter-bank" model to account for his data. 

According to this model the observer can monitor the output of a number 
of narrow-band filters with a total bandwidth of about 150 to 200 Hz. 

The observer makes the detection response when any of the individual out- 
puts exceeds some critical value. On signal trials the output of the 
narrow filter centered on the signal frequency almost always has maxi- 
mum output, whereas on noise-alone trials any member of the bank has equal 
likelihood of exceeding the criterion. 

Ahumada’s filter bank model is not contradicted by the data of 
Sherwin, et al ., Watson, Green, or Pfaff lin and Mathews. In the case of 
Sherwin, et al., who used a 60 Hz wide filter, the lack of association 
of its output with false-alarms intervals could be explained by the fact 
that the filter was too narrow. Incidentally, this would predict that 
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narrowing the filter as jthe authors suggest should decrease the association 
with observer responses in false-alarm intervals. In Watson’s case the 
filter was 100 Hz wide, or in the middle range, so that one would expect 
about equally poor (or good) association with responses on signal and noise 
trials, as is consistent with an examination of his data. Also, the same 
conclusion appears verified from Ahumada’s data. With respect to Green’s 
data, a variance in the observer’s criterion would still decrease the per- 
centage self -agreement . In fact, if the observer is limited to monitoring 
the output of a 200 Hz filter -bank, then a fluctuation in the center 
frequency of the bank would account for the increased decrement in percent 
agreement for signals which are increments in the power of wide-band noise, 
as Green found. Such a fluctuation would not cause a further decrement 
when the signal is a sinusoid. Finally, since the maximum output from a 
single narrow-band filter should be only poorly correlated with the total 
energy in a 100 Hz band, Ahumada's model can give a post-hoc explanation 
for the fact that Pfaff lin and Mathews found inversions in preference 
for signalless different-noise pairs with low relative energy difference. 

It would appear that the filter-bank model with its associated 
decision rule is adequate in a qualitative fashion to account for the data 
from direct comparisons between human observers and electronic devices 
currently available. 

1.6 Objectives of the Present Work 

The filter-bank model has a certain appeal by its analogy to the 
physiology of auditory system. (This analogy is briefly discussed by 
Ahumada. However, the details do not concern us here.) On the other 
hand, the simple energy detection model has had an appeal in auditory 
psychophysics at least partially because it represents an optimum mode 



11 - 


of processing under certain conditions (c.f. Green, 1960; Pfafflin and 
Mathews, 1962; Swets, 1966, Ch. 8). 

In this thesis I shall develop (in Chapter III) the linear-uncertain 
model which dictates an optimum mode of processing for an observer with 
uncertainty regarding the specification of the signal. The linear-uncertain 
model turns out to be a slight generalization of the optimum observer with 
a noisy stored reference signal -derived by Birdsall (1960) . The energy 
model and several other processing models are found to be special cases 
of the linear-uncertain model. Moreover, the linear-uncertain model, 
unlike the energy model, is not completely frustrated by Ahumada’s. findings 
(Chapter IV) . An experimental attempt is made to discriminate between 
the filter-bank model and the linear-uncertain model based on the predictions 
that these models make for the degree of concordance between observers in 
several tasks (Chapter V) . 



CHAPTER IX 


REPRESENTATION THEORY 


2.1 Introduction 

Auditory detection and recognition tasks in the laboratory often 
involve the presentation of complex waveforms to human observers. It 
is assumed that the observer makes a judgmental response based upon 
certain operations which he performs on the input to his ears. Models - 
which attempt to describe the observer’s judgmental performance must 
also describe the operations performed on the input which could give 
rise to the observer's performance. There is a need, then, to obtain 
a description of the impinging stimulation or input process itself. 

A description of or representation theory for auditory waveforms 
can be developed at many different mathematical levels, depending on 
the rigor desired and the degree of error that can be tolerated. For- 
tunately, in psychoacoustical tasks a relatively simple representation 
theory is often adequate. In particular, acoustically presented wave- 
forms are usually of sufficient bandwidth and duration that a finite 
Fourier series approximation to a sample waveform is sufficiently accurate 
for many purposes 1 Green and Swets (1966) hold this view and have 
presented such a model for the representation of sample waveforms 
generated by a real-time stochastic process. 

This chapter presents a representation model for waveforms in the 
frequency domain which is identical to the model used’ by Green and Swets. 
However, the temporal representation model presented by Green and Swets 
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is inconsistent. This fact is discussed in some detail because we wish 
to make extensive use of the geometrical properties of representation 
models . 

Finally, this chapter establishes a notation which will be exploited 
throughout the remainder of this work. 

2. 2 Representation of Waveforms in the Frequency Domain 

Following Green and Swets (1966) we assume that a finite Fourier 
series approximation to a waveform x(t) with nod.c. component in the 
interval ‘ 0 <_ t <_ T is sufficiently accurate for our model construction. 
This approximation to x(t) is given by 

m 

x(t) = l [a-c.(t) + b.s^t)], (2.1) 

i=l 

where W is highest frequency component in the series. 



c iW ■ J 

f2 2-rrit 

T COS T 

(2.2a) 


s i w = j 

fT . 2nit 

J Sin -Tj; — , 

(2.2b) 

and 

T 

l-lff 

x(t)c i (t) dt 

(2.3a) 


0 

T 

x(t)s^(t) dt. 

(2.3b) 


0 


for i = 1, 2 , ..., WT. 

The functions c. (t) and s. ft) form an orthonormal set over the 
i l 


interval [0,T], i.e., 



14 


and 


T 

I 

0 


C i (t) 


dt = 


s?(t) dt = 1 

i 

<» 

0 


c i (t)s.,(t) 


dt i= 


c.(t)c (t) 


dt 


T 

j s.(t) Sj (t) dt - 0 
0 


(2.4a) 


(2.4b) 


for i ^ j and i,j = 1, 2, WT, as is well known and easily verified. 

A less often noticed fact is that the functions 2c^(t) and 2s^ (t) form an 
orthonormal set over the interval [0,T/2]. We shall use both of these 
orthonormal sets of functions below. 

The form of Equation 2.1 indicates that x(t) may be identified with 
the (column) vector 

— j > a 2 > • • • » ^WT*^1 » ^ > 2 > ’ * ’ J ^WT"® * (2*5) 

in a 2WT dimensional vector space (the prime " ' " denotes the transpose 

of a matrix) . The set {^(t), ..., c^ft) ^ (t) , ..., s^(t)} is an 

orthonormal set of basis vectors for the space. This 2WT dimensional 

vector spa’ce, denoted F, will be called the frequency representation space. 

We follow Green and Swets (1966) for our model of noise waveforms. 

Bandlimited Gaussian noise is a real time -parameter stochastic process 

(n(t,n)} where the vector n of random variables has a multivariate normal 

distribution ff(P nJ ? n ) with mean vector y^ and dispersion (variance- covariance 

matrix For a particular sample n = [a^, ..., b^] ' of n 

the corresponding noise waveform sample n(t) = n(t,n) is approximated 

in the interval 0 < t < T by the series 

WT • 

n(t) = l [a.c (t) + b s (t)] (2.6) 

i=l 

where W is the bandwidth of the noise process. 
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The noise is said to be white when 


t n = (N 0 /2)J (2.7) 

where I is an identity matrix, Nq = N/W is the noise power density 

2WT 

and N is- the average noise power. Also, note that det t n - (Nq/ 2) 
and that - = (2/N Q ) J. 

The multivariant normal density function of n for a bandlimited 


' white Gaussian noise process may be written explicitly as 

,-l 


f (n) = (2r)’ WT (det / 


= CvN q ) e 


. -1/2 [n'/ n n] 

) 11 e 

-WT 


( 2 . 8 ) 


2.3 Representation of Waveforms in the Temporal Domain 

Green and Swets (1966) have stated that there exist a set of 
interpolation functions {’J'jCO)* j = 1, . 2WT such that 

i) the set {/2W i/k (t) } is orthonormal over the interval [0,T] , 
2WT ^ . 

ii) x(t) = [ x(^-)i|. (t), and 

j=l 


WT 

iii) iK(t) = l [a(j) -c. (t) + b(j).s. (t)] , 
3 i=l 11 11 


where a(j)^ and b(j)^ are constants. 

The assertion that (i) , (ii) , and (iii) hold simultaneously is 
not strictly true. I shall show conditions under which the frequency 
and temporal representations give nearly equivalent results. 

According to (ii) we require functions ^.(t) such that 


WT 

l [a i c i (t) + b i s ± (t) ] 
i=l 


2WT , WT 


tf.IT A r IT A 

J { J tVi^ 

1=1 ( 1=1 




(t) 
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NT 2WT - 


l t a i l c i C^p* i Ct) +b l s-C^jp *,{«)] 

i=l 1 j = i 1 zw 3 j=l 1 * tv j 


According to (i) we may multiply each side of this expression by 

ip.(t) and integrate with respect to t over [0,T] to obtain 

T T 

NT r • f 


J t a i c i^lif^2W^ + b i s i^2W^2Hp^ 

i=l 


;I [a. j c.(tD^(t) dt + b. j s.Ctj-^Ct] dt] 

1 — 1 n n 


Equating coefficients of a^ and b^ we have 

T 

C-OO^Ct) dt = 2W C i ( 2N 5 

and ^ 

f _ .i. 1 t J -v 


s i ^ Ct) dt = 2W s i ^ * 


But according to (iii) these integrals are the coefficients a(j)^ and 
b(j)., respectively, in the finite Fourier series representation of 
ip. (t) . Therefore, we may write 

WT 

*j Ct) = ^ X [c i ( 2W )c i Ct) + s i ( if )s i Ct)] - 
J i=l 

(Green and Swets omitted reporting this explicit form for ^(t).) 

However, consider the integral 
T T 

f NT WT r 

- (i 5 2 l l (cp^c.Ct) * s.(i)s i( t)] 

o . • i “ 1 j =1 4 

• dt ' 


(2.9) 


" J f C iW C iW + S i ( 2l'P V'2W^ 
1=1 


( 2 . 10 ) 



According to (i) this must be equal to 1/2W when m = n and 0 when 
m / n. This will be approximately true under certain conditions stated 
below. 

The following lemma is a consequence of "sampling theorems" for 
bandlimited waveforms (Middleton, 1960, Ch. 4). 

Lenina 2.1. If x(t) and s(t) are two waveforms, Fourier transform 
band-limited to the same frequency interval [-W,W] and the integer 
2WT » 1, then their cross correlation 


r 1 2WT 

x(t)s(t)dt=i l xCJf)s <Uf>, (2.11) 

0 

where the error in the approximation to the integral for fixed W is 
of order 1/T. 

Over the half interval [0,T/2] Equation 2.11 may be expressed as 


x(t)sct) dt * ~ l x c/j) s (/,) (2.12) 

0 5' 1 

where the error i s of order 2/T. We may apply this latter expression 
to the following integral:' 


T/2 


m n 


0 


WT 

I 

i=l 


[ c m (t) c n (t) + s m (t)s n (t)]dt = ^ I [c m (^)c n C^) + s m (^)s n (^)] 


However, when m = n the left hand side is simply 1/2 + 1/2 = 1 and zero 
otherwise. We have proved 
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Theorem 2.1. When 2WT >> 1 


J ^m^W^n^W-* + s m^2«P s n^2K^ 
1=1 


2W if m = n 


0 if m f n 


where the error -in the approximation to the sum is of order 2/T. 

The theorem is not new (see Goldman, 1953, Appendix VI). 
Corollary 2.1.1 . The functions (t) given by (2.9) form an 

approximately othonormal set over the interval 0 < t < T when 


2WT » 1. 


We have shown that the three conditions which Green and Swets 

impose on the functions i[k (t) are not strictly satisfied simultaneously. 

We can find i|>.(t) which do in fact form an orthonormal set over the 
3 

interval. In this case it will be impossible to express the functions 
exactly by a finite Fourier series as required by condition (iii) . Alter- 
natively, we can require condition (iii) in which case orthogonality of 
the interpolation functions will not be strictly satisfied. In neither 
case will condition (ii) be strictly satisfied. Of course, the differences 
between the frequency and temporal representations of x(t) become small 
when 2WT is sufficiently large. It is instructive to pursue this approxi- 
mate equivalence somewhat further, although doing so represents a digression 
from our main purpose. 


2.4 On the Relation Between the Temporal and Frequency Representations 
when 2WT » 1. 

We may apply Lemma 2.1 to Equations (2.3) to obtain 
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a i “ a i -f 

r 2KT 

- — y x f-i-i c A 

r 2W W i^ 2 W J 

1=1 

(2.14a) 

and 



' b. - fi. -fi 

i i V 

r . 2WT 

Y x f_i_i s r_l_) 
r 2w A ''2W J5 i k 2W J 

(2.14b) 

1=1 

* 

Thus as an approximation to x(t) we may write 


A 

WT 


x(t) = x(t) = 

T [a.c. (t) + b.s. (t)] 
Ail ii 



i n 


WT 2WT 


2W ST J j x (^)[ c i C 2 V C i ( ' t) 
- 1=1 1=1 


+ s i^P s i^ 

rr 2m 

’ft .1 

1 = 1 J 


where ^ (t) is defined by (2.9). 


The vector of weighted samples of x(t) we denote by 


(2.15) 



x - 

_I_r xf A x( 2m )]' 

LX1 2w J, .... x l 2W JJ • 


(2.16) 

We also let 





and define the 

*f "V2 ^1* *•** ®WT ,6 1’ B wP 

2WT x 2WT matrix 


(2.17) 


C 1 V 

c 2 ( 2w ) ••• c WT^2Kp S 1^2W^ 

s (— ) 
WT l 2W j 


< 7 .- 1 - . 

✓2W 

C 1^2W’ ) 

2 2 2 
•** *•* 

« 

* 

f 2 1 

s wt4w j 

* 

« • 
• 

(2.18) 

w 

,2WT, 
C 1 ( ‘2W - 1 

f 2WT. ,2WT. 

. . . . Js l l 2W } " ' 

f 2WT. 
S WT l 2W J 
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Then Equations 2.14 may be expressed in matrix notation as 

x f = C'x. (2.14') 

Theorem 2.2. The matrix C is approximately orthogonal for 
2WT » 1. 

Proof. A necessary and sufficient condition that C be orthogonal 

is that CC' = I where I is a 2WT * 2WT identity matrix. By picking 
til til 

the m — row of C' and the n — column of C and adding the product of corres- 
ponding terms, one of the following expressions is obtained: 



But according to Lemma 2.1 each of these expressions is approximately 
equivalent to a corresponding integral. Examination of Equation 2.4 identi- 
fies these integrals and completes the proof of the theorem. 

A 

Thus, when 2WT becomes large x(t) approaches x(t) and the frequency 
and temporal representation spaces F and T, respectively, become equivalent 
differing only by a choice of axes in the space. We shall, however, take 
as our primary mode of approximation the frequency representation space 
of Section 2.2. 

2.5 Representation of Certain Linear Filters 

The action of a linear filter A on a, waveform x(t) may be conveniently 
represented by a linear transformation on e F. 
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Let A ^ be a (possibly singular) linear transformation on F. Then 

2 

is a projection operator if (i) = A f and (ii) = i4£, that is, 

if A ^ is an idempotent and symmetric matrix. 

Projection operators on F are of particular interest because they 
may be used to represent idealized square-bandpass filters. A 2W q T 
dimension subspace F of F is generated by the projection operator A f 
if for any x ^ e F, e F . The rank of is 2W a T. 

The output y ^ of a filter A- is defined by 

y £■ = C 2 «i9) 

More complex (and more realistic) representations for filters 
may be constructed. The objective here, however, is not to find a repre- 
sentation for such physically realizable filters as might be used elec- 
tronically in real time. Rather, the purpose is to indicate idealized 
operations which might be performed on a waveform by a device with memory 
which can record waveforms for short periods of time. For such a device 
the operation indicated in Equation 2.19 could be performed. 

When the input to a fixed filter A represented by A^ in the frequency 
domain, is a noise vector n^, the output An ^ of the filter also has a 
multivariate normal distribution, although degenerate if the rank of A ^ 
is 2W q T < 2WT. Artificial difficulties in describing the distribution 
of the output of the filter may be overcome when the action of the filter 
is a projection operation. 

Suppose that (s^, s^, s^) is a set of k orthonoimal vectors 

in F . Let B be a 2WT x k matrix with columns s^, s •••> i.e. , 

B = [s ... s k 3- (2.20) 

It is easily verified that A ^ = BB' is a projection operator on F. 
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Furthermore, since 

Ar;S. = BB's. = s., i « 1, .... k, 
f 1 li 

{s-pS2> . •■, may be considered an orthonormal set of basis vectors 

for the k-dimensional subspace F* generated by the action of on vectors 
of F. In fact, for any x ^ e F., 

y = Afpj: = BB'x ^ - B • 

= (sja^^ + (s£c f )s 2 + ... + (s£s f )s k . 

With y* = B’x^, Equation 2.21 establishes an isomorphism between y and 
y* given B. 

Definition. A veotanguZar filter is a k-dimensional projection 
operator on F which has the form A ^ = BB 1 where B is a matrix whose k 
columns form an orthonormal set of vectors . 

Theorem 2.3. The output of a rectangular filter with bandwidth 

W , whose input is white Gaussian nosie with bandwidth W > W , is white 

a . — or 

Gaussian noise with bandwidth W . 

a 

Proof. Let have the density of Equation 2.8, and let = BB ' 
be a rectangular filtering operation where B is the matrix of Equation 2.20. 
Then the output of the filter is represented, up to isomorphism by the 
random vector 

n* = B’n f . (2.22) 

Since the rank of B is k, n* has a multivariate normal distribution with 
mean vector p(n*) = E[B'n^\ = B'E[n^\ - B 'v n , and dispersion matrix 


Tf 
S 2 C f 

• 

s'x 
n f 


( 2 . 21 ) 
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Z n * = E[B'hh'B] = 

«* fl'(N 0 /2)JB = (N 0 /2)B»B 

- (V 2 ^ J k 

where Tj, is a k x k identity matrix. The proof is completed by letting 

2W T a k. 
a 

Xt should be noted that passing from the space F to its subspace F* 

is not, in general, a reversible process. Given only the output vector 

«*, there exist infinitely many vectors n £ such that n* - B'rtr- when 

r i 

B has rank less than 2WT. 

The next chapter develops a general class of models for human 
monaural auditory processing in detection and recognition tasks based 
on the representation theory developed here. 



chapter III 


MODELS OF MONAURAL AUDITORY PROCESSING 
3. 1 Introduction 

Several models have been proposed in the signal detection literature 
to account for the perfoimance of human observers in monaural detection 
tasks. The models postulate a processing mechanism by which the observer 
derives information from the input acoustic waveform regarding which of 
several alternative experimenter-defined hypotheses is presented. Recently the 
"linear", "energy", and "envelope" models have been extensively reviewed 
in Green and Swets (1966). These particular models have in common the 
fact that they describe sensory operations which would give rise to an 
optimum decision variable for some task (not necessarily the one in which 
the observer finds himself) . 

The primary objective of the present chapter is to develop a new 
model, the linear-uncertain model, which includes the linear, envelop and 
energy models as special cases. The theory is presented in a form which 
requires the observer to use all the information available to him in a way 
which is optimum given a residual uncertainty regarding parameters of the 
signal, A discussion of empirical predictions from these models is presented 
in the next chapter. 

3, 2 Task Considerations 

It is assumed that the acoustic input to the observer is Gaussian 
noise, to which occasionally a constant waveform is added. The constant 
waveform is the signal- and its presence has a one-to-one correspondence 
with the experimenter-defined input hypotheses. Thus, the observer's 
task is one in which the signal is specified exactly. It is not, of course. 


24 
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a foregone conclusion that the observer knows precisely the experimenter’s 
exact signal specification. 

Denoting the input waveform sample by x(t) and the signal waveform 
by s(t), 0 < t < T , the alternatives presented to the observer are 

H. : x(t) = n(t) + s(t) ; 

(3.1) 


H q : x(t) = n(t) . 

The task also requires that the observer make a judgmental response r 
indicating which hypothesis alternative was actually presented. For the 
task to be effective, the observer's ability to discriminate between 

J 

and through r must depend only upon the information contained in the 
input waveform. This task requirement may be stated in terms of the 
conditional independence of the response from the presented hypothesis, 
given the presence of the input waveform. 

Assumption 3.1. For every response v in the observer's repertoire 
and input waveform x 


P(r|a;,H) = P(r|ar) (3.2) 

where H is either or 

In our presentation of models for sensory processing we will use the 
representation theory for waveforms developed in Chapter II . Thus , the 
sample space X may be interpreted as either F or T and x(t) may be repre- 
sented by x defined at Equation 2.5 or 2.16 when 2WT is large. We also 
restrict the space of all possible signal vectors to X. Thus, for example, 
a particular signal vector s identifies a one-dimensional subspace of X, 
namely, the set of vectors in X proportional to s. 
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In terms of the representation theory the experimenter-defined hypotheses 


of Equation 3.1 are 


x = n + s; 
Hq : x — n. 


(3.3) 


As in Chapter II we shall not assume that the mean of the 

noise u is zero and shall continue to adopt the more general position 
n 

that the multivariate normal density of the noise vector is 


(3.4) 


where as before the dispersion matrix JE^ = (N Q /2)J. 

The conditional densities of the input vector a; are, from (3.3) and 


(3.4), 


Hj! fjteO = f(x - s) = N ^ n + s,t n ) 

H q : f 0 (a:) = f(«) = ^(u n ,X n ). 


(3.5) 


It is assumed that every observer knows the distribution of the noise, 
including the mean of the noise. Some observers may have, however, uncertainty 
regarding the signal, since it is not always present in the background of noise. 

If the signal s is only known to an observer through an a priori distribution 
G(s) of possible signals, then the unconditional distributions of the input 


known to the observer are 


* h j Or) = 


f (x - s) dG (s) , and 


(3.6) 


H 0 : = f(x) 

respectively, where f is the normal density of (3.4). 

It is well known that the likelihood ratio £(x) , or some monotone function 


of it, z, is the optimal decision variable for discrimination, between hypotheses 
Hj and (Peterson, Birdsall, and Fox, 19S4) . From (3.6) the likelihood ratio 
£00 is. 
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4>0 = h 1 (a:)/h 0 (a;) 

f(x - s)dGfs)/f (a;) 


■I, 

•L 

= f £(x!s)dG(s) 
Jy 


(3.7) 


£ %KT ldGCs) 


where 


Zix | s) = f : (a;)/f 0 (a;) = £{x - s)/f(a;) 
is the conditional likelihood ratio given that s is known exactly. 


(3.8) 


3.3 Signal Uncertainty 

Models of the ideal observer have been constructed for various tasks 
by evaluating (3.7) for the unconditional likelihood ratio function of the 
input process. This is done by assuming that in the task the signal is not 
specified exactly, but rather has a distribution G(s). A resulting model 
of the ideal observer is then taken as a model of human observer performance 
in a task in which the signal is specified exactly. The rationale for this 
approach to model construction appears to be based on the argument that 
prior uncertainty regarding parameters, which are actually constant in the 
task and characterize the signal, should result in a response performance which 
is the same as would arise from an observer with precise knowledge of the 
actual uncertainty of signal parameters. However, a close look at Equation 
3.6 shows that one should not expect this assumed equivalence. The 1^ condi- 
tional density h^(x) is the density of the input process assumed known to the 
observer. The density h^x) does not necessarily represent the knowable distri- 
bution density of the input to a perfectly informed observer. For the latter 
observer, G(s) represents the actual or experimenter-defined signal uncertainty 


in the task. 
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The assumption which will be made here, perhaps gratuitously, is 
that the processing operation performed on the input by a human observer 
is determined by the likelihood ratio Equation 3.7, where the distribution 
G(s) represents the "internal" observer-specified uncertainty regarding 
the signal. In tasks where the signal is fixed the distribution of the 
processing function or decision variable determined by (3.7) is completely 
determined by the distributional characteristics of external and internal 
noise. 

Assumption 3.2. An observer's prior uncertainty regarding the signal 
s, in tasks in which s is specified exactly, may be represented by a multi- 
variate normal density function 

g(») = dG(s)/d = (3.9) 

with mean vector and dispersion matrix The random vector s is inde- 
pendent of the noise vector n. 

It will be shown that (3.9) in connection with (3.7) is quite general 
enough to specify a processing function which includes as special cases 
most previously proposed models of human monaural auditory processing. 

3.4 A General Structure for Processing Models 

Assumption 3.2 allows evaluation of the observer-specified distribu- 
tions of the input defined at (3.6). Under input hypothesis the con- 
volution integral may be found using (3.5) and (3.9), so that 

1*1 (*) = tf(u n + u a ,2 n + y 

(3.10) 


= Hv ,t ) , 

v m’ nr 
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where 


Under H Q , as before, 


y = y + y . 
m p n p a’ 

t - t * t . 

m n s 


h„(x) = i?(y ,% ) 

a v v n ,( ]r 


It is convenient to define the precision matrices 

- C ■ »n * V 1 


C3.ll) 


(3.12) 


(3.13) 


and 


Q » t~ 
n n 


(3.14) 

The likelihood ratio of (3.7) is then found directly by using (2.8): 


l{x) = h 2 (a;)/h 0 (a:) 

_WT V i-Hv-v )'G n (»-y J 

- C2tQ (det^j e m m m 

% -¥^-y n )’3 n (^-y n ) • 

(2ir) W1 (det$ n ) e 

Or, simplifying in terms of the logarithm of the likelihood ratio, 
z* = £w [£(#)] 


(3.15) 


4-(a? - y)'S(®-y) 
2^ p nr m m 


/ detO 

= I i n 5L 

2 n \ detQ n 

. |(X - W n ) '«„<* - p n ) 

, / det£ \ . 

= 2 ^dit^J + 2 ( * " V'^n - V ( * - V 

+ y ' Q (a: ~ y ) - iy ' Q y . 
a p n' 2 a nr a 


(3.16) 


In the following the constant term of (3.16) will be of no interest and 
may be ignored without loss of generality. It may also be seen that (a; - y ) 
is an invariant translation of the input, regardless of the specification of 
the parameters f? m and y^. Thus, we shall take the following form as the 
general structure for our processing models: 
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z = 2d'K 


Q )y + 
m r 


\i'Q (v - -~p ) 
am~ 2 H a 


(3.17) 


where 


V - x - v n , (3.18) 

and (3.11) and (3.16) have been used. 

The decision variable*z of any particular observer within the general 
class of observers covered by assumption 3.2 may be evaluated by specifying 

Sn and p a in C 3 * 17 ) 

3.5 The Model of the Ideal Observer. . 

As an application of the foregoing discussion we may obtain the decision 
variable of the ideal observer for the case of signal known exactly (Peter- 
son, Birdsall, and Fox, 1954). 

Since the signal is specified exactly in the listening task the ideal 
observer knows the specification exactly and it follows that the ideal 
observer's prior specification of the signal is the signal itself; i.e., 

= s. Since the specification is exact, % = 0. It then follows that 

u * S 

= 2 n + % s = 2 n and = § n = (2 /N q )T. The desired values of and 
having been obtained, they may be inserted into (3.17) to give 

= 0 + s'(~ I)(y - Is) 

= I- (s'y - |s‘s) (3.19) 

as the decision variable of the ideal observer. Of course, there are other 
functions, monotone with z^, which would serve as well . 

It may be seen from Chapter XI that s's = E g is the energy of 
the input signal (since s is represented exactly by a finite Fourier 
series) . Defining the dimensionless quantity 


d = 2 V N o- 

the decision variable may be written in the form 

z = *£l}L _ 1 
1 N 0 2 d 

which has a straightforward geometric interpretation in F. 


(3.20) 


(3.21) 


Formally, we may identify the judgmental responses r with ordered sub- 
sets of the range of z. 
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The projection of y onto the line determined by s in F is the vector . 
s. Thus is a linear function of the magnitude of y in the direction 
of s in F. 

In the usual analysis of the two alternative single-interval task (the "Yes- 
No” experiment ) an observer sets a cutoff value for z, say z^, such that when 
the modified input y produces a value of z greater than z^ 3 the observer makes an 
Rj response, i.e., "Yes -- signal was present". The set = {y\z{y') >_ B, y e F} 
is called the criterion region. In the case of the ideal observer it is easy - 
to determine a criterion region based on the decision variable z^ of Equation 
3.21. 

A fixed value 3 of z^ determines a hyperplane in F perpendicular to the 
line of s. This is illustrated in Figure 3.1. The plane of the figure is taken 
to be the plane in F- determined by y and s. The dashed line represents the 
intersection of the hyperplane perpendicular to s and the plane of the paper.. 

As was seen in Chapter II certain linear operations in F may be viewed 
as filtering the input. Such is the case here for the action of the ideal 
observer in producing the decision variable .z . The subspace of F onto which 
the modified input y is projected is simply the line of s. The filtering is 
perfectly "matched" in frequency and phase to the signal. By definition,' 
the decision space of an observer is the range of his decision variable. In 
this case the range is isomorphic to the line of s. (The isomorphism results 
from the fact that the metric of F carries over to the decision space of the 
ideal observer.) 

3.6 A Model for Human Monaural Processing: The Linear Uncertain Model 

The general structure developed in Section 3.4 for a linear observer 
with uncertainty could be taken directly as a model of human monaural . 
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Fig. 3.1. Representation space of the ideal observer for signal specified 
exactly. The paper represents the plane in F determined by the 
modified input y and the signal vector s. A criterion region i? 
in F is determined by the cutoff value 8 on the decision axis 
which is isomorphic with the line of s. In the case illustrated, 
y lies in the region .R which is equivalent to the fact that its 
projection onto the line of s above the cutoff 8. 
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in the task with signal specified exactly. However, without additional 

simplifying assumptions the model is too general in the sense that as 

parameters we would have all the entries of p and % . Most of this freedom 

o s 

is quite unnecessary to obtain strong predictions from the model. I will 
assume that the human observer knows a region of F occupied by the true signal 
a, but is uncertain about the exact magnitude of the components of s within 
that region. This assumption is made more explicit in the following. 

Assumption 3.3. The dispersion matrix X s of the prior distribution for 
an observer a is proportional to a projection operator D ^ on F. Furthermore, 
the subspace generated by the action of D on vectors of F contains 
F n » V B and s - 

The assumption implies that 


M 0 


(3.22) 


where Mq/ 2 is the constant of proportionality, and that has rank 2W^T 
which is less than or equal to 2WT, the dimensionality of F. Since D is 


a 


indempotent, it preserves the magnitude of every vector in F .• In particular ,- 


P p ~ y . 
an n* 


(3.23a) 


P p = y , 
a a a 


(3.23b) 


and 


D s = s. 
a 


(3.23c) 


D represents a rectangular linear filter in F(see Chapter II), and corres- 
ponds to idealized square band-pass filtering of the modified input. 

As we shall see, it is convenient to think of the constant in (3.22) as 


representing the observer’s uncertainty (or imprecision, or variance) 
regarding the signal amplitude, per unit bandwidth of the signal. It is only 
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appropriate to scale such uncertainty relative to physical measurements; 
in this case the uncertainty in the input is the variance N Q of the noise 
process per unit cycle. The ratio of internal uncertainty to the total 
uncertainty, therefore, is defined to be 


A = 


• M o 

M o + N o 


(3.24) 


It follows that A is a nonsingular transformation of the scaled value of 

M 0 , i.e., 

M„/N n = 


O' 0 1 - A * 


Now can be found directly: 


Q - (.X + t ) 

v n s' 


-1 


N 0 M o -1 

s ( 

_ 2 f M 0 

” N ^ + N 

0 0 


= t (X - AD) . 
0 


(3.25) 


The last step is easily verified since D is indempoterit. As a consequence. 




2A 


D. 


(3.26) 


With the explicit representation for the precision matrices in (3.25) and 
(3.26), the decision variable z^ for observer a is found by substitution into 
Equation 3.17: 


or, since 


z = by' dr D + v' - XD ) 1 (y 

a 2 s '■Nq a Jt} a N o a 

y'J = p 1 = p’£> this reduces to 
a a a a 


- 2 V' 
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V' D Jt 


- + (1 - A) 


2p r 2? (v - ) 

a a vy 2 H or 


“a " ^ A ' FiT ♦ 

The decision variable z^ of the linear-uncertain observer is thus 

a convex combination of the standardized energy of the projection of the 

modified input y onto F and a linear -term depending upon the cross correla- 


tion of y with the supposed signal y^. From Equation 3.24 it is seen that 
as the prior internal specification of signal becomes poor, that is the uncer- 


tainty becomes large relative to N^, the uncertainty parameter A approaches 
1 and a will behave like a pure energy observer. On the other hand as the 


observer a becomes increasingly certain of the correctness of the internal 


specification signal y^, X approaches zero and a behaves like a pure linear 
observer. When A lies between the extremes of 0 and 1, the decision variable 
z a ma Y b e seen to be a linear transformation of the squared radius g of the 
hyperspheroidal cylinder 


y ) *Z> (y + 
2 A a a J 




with center at - D% in F. 

2A a a 


The decision regions for a are depicted in Figure 3.2 (compare with 
Figure 3.1). Here the plane of the paper is determined by the true signal s 
and the prior specified signal y Q . The center of the region Rq lies on the 
line of y but on the opposite side of the origin. The modified input y is 
projected onto the subspace F . If the image of y lies outside the hypersphere 
then > ag + b (where 3 is the squared radius and a and b are appropriate 
constants) and observer a accepts the hypothesis that the input contains the 
signal. Otherwise, the image is in and the observer rejects that hypothesis. 
As A increases the center moves toward the origin and the direction (phase) 
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Fig. 3.2. Representation space o£ the linear-uncertain observer. The 

paper represents the plane in F a determined by the signal vector 
s and the observer’s mean representation of the signal y a . A 
criterion region R\ is determined .by a cutoff value $ on the decision 
axis which is isomorphic to the squared radius of a hyperspheroid 
in F. As A -*■ 1 the center moves on the line of p a to the origin 
0 of the space and the observer becomes a simple energy detector. 
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of y^ becomes irrelevant. As X -* 0 the center moves away from the tip 
of and the decision boundary becomes a perpendicular hyperplane at' the 
point of intersection with the line of y^. The observer's decision variable 
then behaves like the ideal observer's decision with variable z . but with 

i ’ 

V substituted for s. , 
a 

It is perhaps surprising that the linear-uncertain model of the observer's 
decision variable is a generalization of most previously proposed detection 
theory based models for human monaural auditory processing. This will be 
demonstrated in the next section. 

3.7 Special Cases of the Linear-Uncertain Model 

The models of this section are obtained from the fundamental Equation 3.27 
for the decision variable of the linear-uncertain observer. 

The linear 1 observer. When X = 0 in (3.27) 

(3.28) 

and the observer performs purely linear operations on the input waveform. 


z = — v'D (y - iy ) 
a Nq a a 5 2 ^ a 


If u = s and D -I, the linear observer is optimum since z = z of 
Equation 3.19. The observer is even optimum if I since according to 

Assumption 3. 3 D cannot degrade s. Thus, non-optimal performance of the linear 
observer must be due to the fact that y^ f s or be due to additional internal 
noise or criterion variability. 

The energy observer. When X = 1 in (3.27) 


z = 
a 


»' D J> 

N o 


(3.29) 


and therefore the decision variable of observer a is based only on the 
energy of the modified input passing through an idealized square band-pass 
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filter with bandwidth W^. Other forms of an energy model have been proposed. 
A more usual assumption is that it is the input x itself whose energy is 
found at the output of the filter, .i.e.. 


z 

a 


x'D x 
a 


N 

o 


(3.30) 


The two models are identical only when mean of the noise is the zero vector. 


When the mean is not zero, these energy models make different predictions for 
observer performance for some tasks, as will be seen in the next chapter. 

The envelope observer is a very special case of the 
Equation 3.29. Let sj_(t) be the Hilbert transformation of the signal wave- 
form s(t), 0 < t <T (cf. Hancock and Wintz, 1966). If |x(t) | is the 

instantaneous modulus of x(t), then in quite general terns the function 

B n (t) = (|s/t) | + [s x (t)l) 1/2 , 0 < t <T, (3.31) 

is the (instantaneous) envelope of s (t) . For example, if s (t) = Asin 2itut, 
then s_j_(t) = Acos 2v<i)t and E n (t) = A, as one might expect. The Hilbert 

transform is orthogonal to the original waveform, i.e., 

T 

| s(t)sj_(t) =0. (3.32) 

° 

If Sj_ is the vector of F corresponding to s_j_(t) , then (3.32) implies that 


s ' s_j_ “ 0. (3.33) 

Now if s (t) is a sinusoid, then s(t) and Sj_(t) differ only in phase. 

If observer a knows s but does- not know the phase of s then it may be shoicn 
that a should let 


D a = ss 7 1 | s | | + S4_sj/||sjl (3.34) 

in Equation 3.29 (cf. Wainstein and Zubako, 1962, Sec. 33). It. may also 
be shown that D is then a projection operator by the methods of Section 2.4. 
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The idempotency of D follows with help of (3.33). When the substitution 
of D of (3.34) into (3.29) is made it results in the decision variable of 
an envelope observer: 


(s'i/) 2 /l M| + (sjy ) 2 / 1 |sj | 


z = 
a 


N- 


(3.35) 


It is clear from (3.35) that z^ for the envelope observer is the squared 
radius of a circle lying in the plane of s and Sj_ in F with center at the 
origin. This is illustrated in Figure 3.3. The modified input y is projected 
onto the plane of s and If the length of the projection is greater than 

then it lies outside a circle z a ~ & in the criterion region R^. The 
circle defines a hypercylinder in F whose inside is R^. 

It has been pointed out elsewhere that the energy model and the envelope 
model make similar predictions in a variety of tasks (Green and Swets, 1966). 
In a general way this is apparent from the geometry for the two observers. 

The primary distinguishing characteristic of the envelope observer would 
appear to be his narrow and precise bandwidth as an energy observer. This 
conclusion is also born out by a comparison of the statistics of /tT with 
the statistics of the output of a narrow-band filter excited by Gaussian 
noise (Green and Swets, 1966, Section 6.5.2, and references there cited). 

3.8 The Noisy Linear-Uncertain Model 

An assumption of noise-free processing has been implicit in the presen- 
tation of the linear-uncertain model in the previous two sections. For any 


realizable system, e.g., a human processor, this is certainly a generous 
assumption. Two kinds of internal noise, have often been suggested. The 
first is that noise is added to the input x before it is processed. The 




Fig. 3.3. The representation space of an envelope observer. The paper 

represents the plane in F determined by the orthogonal vectors 
s and Sj_. A criterion region is determined by a cutoff value 
3 on the decision axis which is isomorphic- to the squared radius 
of a circle with center .at the origin in the plane. If y projects 
outside the circle in the region i?i, the observer accepts the 
hypothesis Hi. 
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second is that the response is a noisy transformation of the decision 
variable z^. Less often it has also been suggested that the observer may 
have a noisy memory for the specification of signal parameters relevant 
to good detection performance. A noisy memory would lead to a fluctua- 
tion in the processing of the input. The details of this latter source 
for degradation of observer performance will be considered first. 

Noisy memory models. It is possible that the observer-specified signal 

u varies in a random manner which -is not under observer control. If this 
a 

were then the case, we might suppose that observer specified signal vector 
could be adequately characterized as 

u* = u + e (3.36) 

a a 

where as before p Q is the mean of the prior distribution for s and e is 
a random error term, perhaps with a multivariate normal distribution density 
with mean vector 0 and dispersion matrix 

If the linear-uncertain observer is unaware of the variability in the 
mean of his prior distribution, then his decision variable would remain in 
the form given by z q in (3.27), but with y* of (3.36) replacing y^. It is 
important to realize that the distributional character of z^ is thereby 
changed as well. 

• Another possibility may be considered by assuming that the observer 
specifies the signal from memory on each trial, but that there is random 
error in this specification. The error could be due to changing uncertainty 
regarding signal parameters, for example. If the observer is aware of the 
fluctuation in his memory, then it may be shown that the observer should be 
a linear-quadratic processor of Equation 3.17, where is the dispersion 



- 42 - 


matrix of the memory signal vector whose mean is (presumably) the true 
signal s (Birdsall, 1960). This version of the noisy memory model 
differs formally from the linear-uncertain model only in that = s, and 
thus is another special case of the latter.* 

The sensory-noise model. If we. suppose that the sensory encoding produces 
random fluctuations in the input waveform, this could be represented by 
replacing the input x in the linear-uncertain model by 

x* = x + e. , (3.37) 

Birdsall (1960) has shown that a linear processor with Gaussian inter- 
nal noise added to the input has quite different psychometric functions from 
his noisy memory model mentioned above (for the special case where 2WT = 1) . . 
This is to be expected since x* is the effective input to the observer and 
would not change his mode of processing when e is normally distributed. 

The noisy decision Variable model. Perhaps the most common assumption 
made in one form or another is to consider that the decision variable itself 
is noisy; i.e., that 

z* = z + e (3.38) 

a a 

replaces the decision variable z^. Strangely enough, however, only until 
very recently have some of the implications of this plausible model been 
investigated (Wickelgren, 1968) . 

There are, of course, many other ways of degrading an observer's decision 
variable. The uncertainty parameter A could be made a random variable. The 
center frequency of the pass-band could fluctuate (a possibility recently 
investigated by Henning (1967)) '. The quality of the decision variable could 
vary, and so on. For the present, however, let us be content with noise 


Birdsall 's work (1960) provided much of the incentive for developing the 
. linear-uncertain model. 
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added to the input , unknown fluctuation in the observer’s specification 
in his prior mean (e^) , and perturbations of the decision variable itself 
(e 3 ). 

The decision variable of the noisy linear-uncertain observer is, then, 

+ ‘P’V* + e l } * (3.39) 

+ (1 - X) (u2 * e 2 ) ’O a [2(y + - Cv a + e 2 )] + e 3 - 

In order to be concrete in the following, we suppose that is normally 
distributed with mean zero and variance V^, and that e 2 and are each multi- 
variate normal with mean vector zero and dispersion matrices 


respectively, 
independent . 


N V N V 

„ W 0 V 2 T J * U V 3 T 

\ - 2 1 and \ ’ — T - 

Further, e^, e 2 , and the noise vector n are assumed mutually 
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MEASURES OF CONCORDANCE BETWEEN OBSERVERS 

4. 1 Introduction 

The previous chapter outlines the basic theory of a general model ■ 
for monaural auditory processing, the linear-uncertain model. This 
chapter investigates a variety of predictions of the model. Since the 
linear-uncertain model includes as special cases a number of previously 
proposed models for auditory processing, the methods of this chapter apply 
equally well to them. The derivation of the level of association between 
observers appears here for the first time. Possible methods for estimating 
parameters for .the linear- uncertain observer using non- parametric measures 
of association are considered. The psychometric function for the linear 
uncertain observer is approximated, and the relation between measures of 
performance and concordance are investigated. 

4.2 Observer Performance 

In a two-alternative task an observer's decision variable z may be con- 
sidered as having the distributions Fq(z) and F^(z) conditional upon Hq and 
Hj, respectively. If the observer uses the rule "say if z >_ z c> other- 
wise Hq" then the observer's hit rate would be 

CO 

P(H) = J d? 1 (z) (4.1) 

z 

c 

and his false-alarm rate would be 

oo 

P(F) = j dF Q (z) . •• (4.2) 

z 

An observer's receiver operating characteristic (ROC) curve is the set 
of ordered pairs [P(F),P(H)] parameterized by z £ (Peterson, Birdsall, and Fox, 
1954) . 


44 - 
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If the observer uses the rule (2) "say r if z = z ", where r 

c c c 

is a strictly increasing function of z^, the probabilities P(H) and P(F) re- 
main defined for each value of v but have no special significance except as 

c 

coordinates of the ROC curve. Clearly the ROC curves determined by z using 
either decision rule are identical. 

The ideal observer 1 may be characterized as that observer whose hit- 
rate is at least as great as the hit-rate of any other observer with the 
same false-alarm rate (Birdsall, 1966). This characterization is more general 
than that given in the last chapter where it is assumed that the signal is 
specified exactly. 

It is clear from the definitions that the area under the ROC curve 
for ideal observer must be at least as great as the area under the ROC curve 
for any other observer in the same task. An important result regarding the 
area was obtained by Green (1964a) . 

Lemma 4.1. If z^ and are two independent samples of an observer's 
decision variable, conditional upon and Hq, respectively, then the area 
under the ROC curve generated by the decision variable z (using either rule 
(1) or rule (2) above) is given by 

• ’WW d V I c ) = Pl >al » *«o>- C*- 3 > 

.00 

A proof is given in Green and Moses (1966) . I have generalized somewhat 
the language over the original statement of the result since Green interprets 
the probability on the right as the probability of making a correct response 
in a two-interval forced-choice task in which the observer knows that exactly 
one of the intervals contains signal-plus-noise and it will be shown that the 
ROC area is related to non-parametric measures of association between observers. 
4.S nonparametric Measures of Association 

We review two popular nonparametric measures of association which will 


be us’eful in the discussion of observer performance and concordance. 
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Following Kendall (1962), for two random variables z and z„ and 

a 8 

two independent, joint samples (z . ,z„.) and (z . ,z„.) of the variables, 

ax 81 aj Bl . 

define 

a.. = sgn(z . - z .) 
ij ai ay 

1 if z - z . > 0 

ai aj 

0 if z . - z . = 0 

ax aj 

*-1 if z . - z . < 0. 

ai aj 


Similarly, 


Then, tau is defined by 


b ij ■ - V' 


x = Corr(a. . ,b. .) . 

ij’ 


(4.4) 


(4.5) 


(4.6) 


If (z^z^) ^ as a cont f nuous bivariate distribution, 

• t = ^(a^by), (4.7) 

since in this case Ffa^) = ^(b^) = 0 and Var(a^) = Var(b_) = 1. 

Another way of looking at tau is in terms of the probability of agree- 
ment and disagreement of the sign of the difference between samples of the 
decision variables. With 


P(S) = P(a. .b. . > 0) 
ij J 


and 


(4.8) 


?(D) = P(a ij b. j < 0), 


it is readily shown that 


S ( a ij b ij) = P(S ^ " p ( D )- (4.9) 

Goodman and Kruskal (1954) were interested in indices of association 

for bivariate, ordered contingency tables. In this case the random variables 
are not continuous. The probability of a tie in- one or the other of the 
variables is 
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P(T) = PCa-.-b... s C 4 - 10 ) 

so that 

P(S) + P(D) + P(T) = 1. 

The Goodman-Kruskal coefficient gamma is based on only the untied pairs of 
the variables: 

Y s [P (S) - P(D)]/[1 . PCT)] (4.11) 

for continuous variables P(T) =0, so that y = t in this case. 

It is also possible to view gamma as a conditional correlation between 


signs. 

Proposition 4.1. 

y = Corr(a^ ,b^ | ^ 0, b^ f 0). (4.12) 

4. 4 The ROC Area as a Measure of Association Between Decision Variables s 
We may define the perfect decision variable by 

1 if H-. 


z = 
P 


(4.13) 


0 if H 


0 ‘ 


This is the experimenter's "decision variable" and, of course, does not 
depend upon the input x. 

Theorem 4.1. Let P. be the area under the ROC curve for observer a 

A 

with a continuous decision variable z , and y the Goodman-Kruskal coefficient 


between the perfect decision variable z^ and z a . Then, 

P A = (Y + l)/2. 


(4.14) 


Proof. Let ( z a > Z p) and C z ' ,Z p ^ two i n d e P en dent joint samples of 
the decision variables. Then, since the value of z^ specifies the hypothesis. 


p[(z - z r - ) (z - z' ) > 0] = P(z > z ). 
a o p p ctl ctO 
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The right hand side is P^ by Lemma 4.1. The left hand side is apparently 
P(S)/[1 - P(t)3. But, P (D) / [ 1 - P( T )] = 1 - P(S)/[1 - P(t)3, so that 
P(S)/[1 - P(t)] " (y + l)/2, completing the proof. 

The theorem's importance lies in the fact fiat it suggests a way to 
make reasonable estimates of the ROC area, even when the observed decision 
variable is not continuous. In Appendix I it is shown that if P(T ) is the 
probability of a tie in the observed decision variable z , an appropriate 
estimate of the a'ea under the ROC curve is given by 


est I’ A = [est Corrfa^jb^. b.^ ^ 0) + l]/2 


[ 


PCS) - P(D) 


A r - PCTjT p ) [1 - P (T p ) 3 


+ 1 


(4.15) 


where T p is the event b.^ ^ 0. A computing formula is also given. 

When both decision variables, z a and Zg, are considered continuous the 
estimate of tau is given by 


- est CorrCa..,^.) 


= PCS) - PCD) 

/[I - P(T a )][l - P(T e )3 


4.5 The Relation Between Parametric and Nonparametric Measures of Association 

The exact distribution theory for the noisy linear-uncertain model is 

extremely difficult and is apparently unsolved. (In the internal noise-free 

case, however, the distributions can be shown to be non-central chi-square.) 

However, the moments of z* are relatively less difficult to determine. This 

opens the possibility of trying to approximate the theoretical value of x 

between z^ and z^ using only the moments of the joint distribution of z^ 

and Zg. It is at least plausible that sample estimates of x could then be 

used to make estimtes of the unknown parameters entering into an observer's 

decision variable such as his uncertainty parameter X or bandwidth-time product 

a 

$ a = W^T. We shall call such a procedure ''nonparametric estimation". 
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Perhaps a somewhat less ambitious approach than nonparametric estimation 
would be to try to find experimental situations which would discriminate 
between alternative models using nearly any measure of concordance. If 
such experiments could be found, then statistics such as the linear corre- 
lations could serve at the theoretical level, and tau could serve at the 
empirical level. This latter procedure is the one followed here. It is 
worthwhile to give a more complete justification, however. 

Greiner's relation. The ordinary product -moment (linear) correlation 
between two decision variables z and z„ will be denoted by 

CL p 

"aB " Corr( V V ' 

If z and z have a bivariate normal distribution with correlation p _, 
a 8 aB 

then it is known (Kendall, 1962; Greiner, 1909) that 

^8 = SinCir V 2) ’ C4 * 17) 

When the stated assumptions are tenable this relation provides a consistent 

estimate of p „ from an estimate of t .. 

H a8 aB 

Griener's relation is a special case. of a more general approximation to 
T based on the joint moments for non-normal variation of random variables. 
Kendall (1949) assumed that the joint distribution may be closely approximated 
by a truncated Gram-Charlier series. With standardized moments defined by 


M ! . = E 


fr v^.1 1 pLi£!V I } \ 

j_ L [’'ar(z a )] 7l J [ [Var(z e )]' A J J 


and p = p a g = jijp Kendall's approximation* to t is given by 


aB tt 


2 . -1 
sm p + 


24ir(l-p 2 ) 


372 [Cy 40 + y 04 " 6K3p " 2p3) 


- 40 ^ + y ^ 3 - 6 p ) + 6 p (.^22 - 2 P 2 " 


(4.18) 


Kendall's original expression, his Equation (25), was given in terms of 
the cumulants of the joint distribution. 



so - 


If the decision variables are jointly normal the correction term (in 
braces) to Greiner's relation is zero. 

A small amount of empirical data presented by Kendall suggests that 
the unfortunate complexity of the correction term to Greiner's 
relation cannot be safely ignored for precise results in many situations. 
Unfortunately little more seems to be knovm regarding estimating p from x. 

For the linear-uncertain model the correction term is zero only if 
both observers are linear, i.e., -X - X 0 = 1 (X = 1 - X) . Otherwise the 

ct p 

correction term is not only non-zero, but contains moments of the physical 
waveforms not ordinarily measured, e„g., 

s 4 (t) dt, 

0 

for a constant signal waveform s(t). (Interesting results can be obtained 
when the signal is a sample of Gaussian noise, which however does not 
concern us here.) In the general case the correction term is extremely 
complicated for linear uncertain observers and it would require advanced 
computer techniques in non-linear estimation to obtain estimates of model 
parameters. I am forced to conclude that, although feasible, non-parametric 
estimation is not presently practical. 

The alternative mentioned above to non-parametric estimation depends 
upon an approximately monotone relation between x and p . An examination of 
Kendall's approximation to x is not very revealing in this regard. The 
difficulty is that factors which affect p may also affect the value of the . 
other joint moments in some unknown fashion. In our situation, however, 
we know that the association between linear-uncertain decision variables depends 
upon common elements of the noise process at the input. As was seen in 



- 51 - 


Chapter III the decision variable of the linear-uncertain observer is a linear 

transformation of the squared radius of a hyperspheroidal cylinder in the 

representation space. By assumption two observers share a common subspace 

in the frequency representation space. Thus, at least in the case where 

the cosine of the angle between p and p is not negative, the radii will 

ct P 

tend to increase and decrease together, i.e., will be to some extent concor- 
dant . Both x and p are measures of concordance in this sense (Kruskal , 1958) 
and therefore may be expected to be highly correlated with one another. 

4.6 The Correlation between Observers 

The results of this section make the implicit assumption that to a first 
order of approximation the joint distribution of the decision variables may 
be considered bivariate normal. Somewhat more accurate results for some 
purposes, with a corresponding increase' in technical difficulty, might be 
expected if marginal monotone transformations of the variables are made prior 
to the computation of the moments. Some possibilities for normalizing trans- 
formaitons are considered in Lamphier and Birdsall (I960). 

We consider the general case where the modified input to observer a is 
y a = y + and the input to observer 8 is y^ = y + s^. It is convenient 
to have the following definitions (we assume the observers’ filters overlap 

in F) : p = s (4.19) 

s a 


R a& T ag 2R ap /N 0 

E = R , d = 2E /N rt 
a aa a a 0 


(4.20) 

(4.21) 


Thus, for example, r means 2u'D „s„/ N„. the standardized cross-correla- 
r aSg aB 8 O’ 

tion in the joint subspace determined by = D between the prior mean 

vector p and signal vector s presented to observer 8. Of course, if s 
a 8 p 

is the null signal 0 , then r =0 and r =0 regardless of the value 

aS 3 S a S B 


of p^ or s q , respectively. 
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Theorem 4.2. The covariance between noisy linear-uncertain observers 


with decision variables z* and z* as given by Equation 3.39 is given by 


Cov(z*,z*) = A A.[5 + r ] + A A r 

a’ 6 a 3 ag s s Q J a 3 as Q 

— p 

+ A A.r a + A A r a . 
a 3 s 8 a 8 a8 
a 


Further, the variance of z* is given by 

Var(z*) * A 2 [6 (1 + V _) 2 + d (1 + V .)] +2~Ar (1 + V -7 

or a L a al al J a a as al 

a a 

+ P{6 V ,[2(1 + V .) + V J 
a a a2 l al a2 J 


(4.22) 


(4.23) 


♦ V 1 * V al \V + W s r as ’ V c2>* V a3- 

where S « mini 5 ,6 }, 5 = W T and a A = 1 - A. 

ag a 1 g a a 

The proof of the theorem is given in Appendix II. 

The linear correlation between noisy linear-uncertain observers is 
thus given by 


a*g* 


='Corr(z* z*) = 
a p 


c °v(z£,z*) 


[Var(z*)Var(z*)J 


1/2 


(4.24) 


There are, of course, many special cases which could be considered 
by assigning parameter values. There are over 2000 such cases for extreme 
values of parameters of which perhaps 100 might be considered "interesting" 
for some purpose. We shall consider only several of these interesting cases. 

Theorem 4. 3 . The necessary and sufficient conditions that two noisy 
linear uncertain observers have identical correlation on signal-plus-noise 
trials (H ) and noise-alone trials (H Q ) are that 

i) both observers are linear processors, i.e., = 0, and that 

ii) neither observer has 'a noisy memory-specified reference signal, 

1,e * ’ V a2 = V 82 = °' 
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Proof. Under H , s = $ Q - s so that r = d = d = d . 

lag s s R s Sft s 

qI • a p ct p 

J is obtained by inserting these values into (4.24). Likewise under 

H n , s - s a — 0, so that r = r =r = r Q = r=d = d = 0 ^ 

0 a g s s„ as„ as^ s 3 s„g s s„ 

a ® fo) a a ^ fl1 a fnf 
which when inserted into (4.24) gives p . Requiring that p_ = p . 

for any (non- trivial) values of y^ and y^ can then be seen to be equivalent 

to conditions (i) and (ii) . 

Corollary 4.3.1. Under conditions (i) and (ii) of the theorem the 
correlation is 


P = 


"ag 


"V * V * V a 3 HVl + V M ) + V B3 ]> 


1/2 


(4.25) 


and Greiner's relation holds so that 

. sin-lp. (4.26) 

7f 

It would be desirable to have the theorem stated in terms of x rather 
than p. The sufficiency of the conditions for equal x values on 11^ and H Q 
is given in the preceeding corollary. That equal x values_imply the conditions 
remains a reasonable conjecture. 

An interesting situation arises if one of the observers is an electronic 
energy detector which receives only noise at the input on both Hj and H Q 
trials. In this case = 1, V &1 = 0, V ^ = 0, and = 0 under 

both and H^. Further, we let 

Var. (z*) = Var(z*js = s) 
l v a v a* a ' 


and 


Var n (z*) = Var(z*|s = O') 
0 a a a 


•. (4.27) 


from (4.23). It is clear on inspection that Var^z*) is always greater than 

or equal to Var^(z*) regardless of model parameter -values . But, from (4.22) 

we have for both H, and 1L that Cov(z* z*) = A 6 This proves 

1 0 a’ 3 a ag K 
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Theorem 4.4. The correlation between a noisy linear-uncertain observer 
and an electronic energy detector which receives only noise is never less 
on noise-alone trials than on signal-plus- noise trials. 

The importance of the theorem stems from the fact that the opposite 
result is expected from the filter bank model discussed in Chapter I. The 
filter bank model assumes that the human' observer responds to the maximum 
of outputs from narrow-band filters in the filter bank. A relatively 
narrow band electronic filter will respond to components of the noise in a 
narrow frequency band around the center frequency of the signal and so 
should be only poorly correlated with the observer's response on noise-alone 
trials. On the other hand, when signal is presented to the observer the 
maximum output of his filters will nearly always occur in the passband of the 
energy detector and thus increase the communality of the two observers. 

It may also be argued that if the signal is presented to a narrow band 
energy detector on signal-plus-noise trials then both the filter bank model 
and the linear-uncertain model predict increased correlation on signal trials. 
Thus, that experiment does not provide the comparison between the models 
afforded by giving the energy detector only noise. An experiment using 
the special conditions of Theorem 4.4 would be necessary only if the linear- 
uncertain model can predict the results of Ahumada's experiment reported in 
Chapter I. We now show that it can. 

■ The squared correlation between the linear-uncertain observer and an 
energy detector with identical inputs on signal-plus-noise trials is 


P 



^ct (<S a3 




A r ] : 
a as J 


v ari(z*)C6 6 * V 


(4.28) 
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On noise-alone trials this reduces to 


(X 6 J 2 
2 a aB 

^0 “ 

Vsr O tz a )S B 


(4.29) 


Since 6^ = min(S a ,5g) and Var 0 (z*) does not depend upon 6^, p 2 is maximized 

by setting -6^ = 6^. For p 2 it is clear on inspection that the- expression • 

will be maximized by talcing 6„ < 6 . So we may set 6 „ = 6„. To investi- 

gate further we differentiate the logarithm of with respect to 6^ (ignoring 

the constant Var. (2*) ) : 

1 a 


^{2 MX 0 (6 p + d) + X^ -** S{J + d s D> 

2X 


X (6 a + d ) + A r 6 g + d s 

a B s a as 


By setting the expression equal to zero and manipulating, we obtain 

X 

5„ = ■— r - d 
B X as s 

a . . 


as the value of <5„ which maximizes p? (as long as the right hand term is 

p i 

less than 61. If r = d , as in the noisy memory model, then the solution 
a as s J 

is reasonable if the linear-uncertain observer has somewhat less memory noise 

M„ than N n , since X /X = 

0 0 a a 0 0 

4. 7 The performance of Linear-uncertain Observers 

It has been found empirically that the ROC curves of human observers 
often appear rather like straight lines when plotted on double-probability 
paper, that is when the coordinates P(H) and P(F) are transformed to deviation 
scores for a standard normal distribution. Such ROC curves would be exactly 
straight when there exis ts a monotone transformation of z^ such that the 
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conditional distributions Fq[iJj(z^)] and F^[tf»(z^)] are both normal. It has 
been shown that in a task with signal specified exactly, strictly speaking, 
there exists no such transformation ift unless the observer is equivalent to 
a linear processor of the input (Wilcox, 1967). 

To a first order of approximation, however, we may take the observer's 
decision variable z^ as being normally distributed with mean i?(z a ) and variance 
VCz^). Generally the means and variances will be different on and H Q trials. 

Theorem 4.5. Let an observer have a decision variable z which is distri- 
buted normally with mean y^ and variance a 2 on trials and with mean y^ and 
variance o 2 on trials. Then, the area under the ROC curve generated by 


z is given by 




P A = Kt) dt. 


(4.30) 


where $(t) is the standard normal density function and 


^ i - V 
(a 2 + o 2 ] i/2 


(4.31) 


The proof is made straight-forward by inserting the appropriate normal 
densities into Equation 4.3 and making a simple change of variables in order 
to reverse the order of integration. 

It may be noted that if the observer is the ideal observer we find that 


pi 

Z V NT 


= d« = d 1 / 2 
s s 


(4.32) 


as is well known. 


We shall call d^ obtained from (4.30) the sensitivity index That is 


if P, is known 
A 


* Jeffress (1967) has defined index d which is d' of Equation 4.31. I prefer 

z z 

the more general definition of d ' of Equation 4.33, of which is a special 


case. 
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d* = »^“ X (P A ) (4 .33) 

where $ is the standard normal distribution function. 

Theorem 4,6. The sensitivity index of the noisy linear -uncertain 
obeservec with decision variable z* is given approximately by 


d z* " 


J2[\ d /2 + A r ] 
1 a s- a as 


[Var 0 (z*) + Var 1 (z*)] 
where Var^ and Var^ are given by (4.27). 


V 2 


The theorem follows from the fact, proved in Appendix II, that 


(4.34) 


£(z*) - \ [fi (1 + V ,) + d /2] 
' a J a 1 a v al' s 1 
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(4.35) 


The efficiency n a of an observer a is defined as a ratio of signal 


energies (Tanner and Birdsall, 1958): 


n = E /E 
a z ' s 
a 


(4.36) 


where E is the signal energy necessary for the ideal observer to perform 
z a 

at the same overall level as observer a. IVe shall take this to mean that 

E is the signal energy necessary for the area under the ROC curve of the 
a 

ideal observer to be equal to the area under the ROC curve of an observer 
a who has signal energy E $ in his detection task. Then, for the case in which 
signal is specified exactly, E may be found from 


d' = /2E~7 nT 

z z ' 0 

a a 


Thus, 


n » (d* /d ') • 
a K z s J 
a 


(4.37) 
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Plots of P(C) (the probability of a correct response inaa two-interval 

forced-choice task) , P , log n , or log (d 1 ) 2 versus log d are examples of 

a 

psychometric functions. The first is most commonly used, although the latter 

two are definitely superior for comparing processing models. 

Birdsall (1960) has investigated the shape of psychometric functions 

of noisy-memory observer in the case 2W^T = 1. McGill (1967) has obtained 

the psychometric functions for the internal-noise -free energy model for 

several values of 2W T. 

a 

4i8 The Relation Between Performance and Concordance 

Theorem 4. 7. Let n a be the efficiency of a linear observer a with no 
reference signal noise (V ^ = 0) and let p be the correlation between a and 
the ideal observer. Then, 


n a = p 2 . (4.38) 

Proof . p is given by (4.25) with d = d , V D1 = = 0. n is 

Sg s pi po 

given by the square of (4.34) divided by and with X^ = 0, V ' ^ ~ 0. 

The equality (4.38) follows. • 

Apparently the efficiency of the noisy linear-uncertain observer in the 
general case cannot be expressed solely in terms of the linear correlation 
with the ideal observer. 

Theorem 4.8. Let p. = Corr(z*,z* |H.) , V . = Var(z*]H.), i = 0,1. 

1 ct p x ai ot x 


Then 

Pl /v“Tv 


1' v al , 81 " p 0 /V a0 V 50 “ X p 


a *“ 8 


The theorem is proved by writing out (4.39) in detail for noisy linear- 

uncertain observers. The theorem provides a way to predict inter-observer 

correlation differences from the observer sensitivity indices and observer 

correlations with the ideal observer. First note that V _/V . = v is the 

a0 al a 
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slope of the ROC curve plotted on double probability paper, as long as 
the straight line approximation is good (Green and Swets, 1966, Ch. 3). By 


dividing both sides of (4.39) by ATTv T we obtain 

ccl ctO 


p i VVe - 


Arp^ 


-< 
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+ , 
)»» i* 
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1/2 X 

d* + a 
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L 2 J 


(4.40) 


Since ,d^, , v and v_ may be estimated from performance data, the remaining 
a p a 

unknowns are a single parameter in the form X /Af for each observer. 

Suppose that observer p is the ideal observer, which may be simulated by a 

cross-correlator when signal is specified exactly. Then = 0, = 1 

and d' = AT~. In this case (4.40) can be written in the form 

z p s 


C 1 - e </^ 


AT“ AF 

al s 


(4.41) 


which provides an estimate of X^/AT^ from the correlations of observer a with 
the ideal observer. These estimates may be inserted into (4.40) to provide 
a prediction of the quantity - p^/vA^ for each pair of observers. 
Obviously such predictions should be considered only as good as the estimates 


of v and v„. 
a 3 



CHAPTER V 


AN EXPERIMENTAL INVESTIGATION OF INTER-OBSERVER CONCORDANCE 
5.1 Predictions of the Models 

An experiment was conducted to examine the effects of several input 
conditions on the level of concordance between observers in a single-inter- 
val detection task. The observers were three human observers and two 
electronic devices. The first device was the "ideal observer" (CC) , that 
is, it computed the cross-correlation of the signal waveform with the noise 
waveform present in the presentation interval. The second device, the 
energy detector (ED), computed the energy of the filtered noise waveforms in the 
presentation interval. The human observers gave responses indicating 
their confidence that signal was present in the presentation interval . 

A nonparametric measure of concordance was computed on both and trials 
between all pairs of the five observers in each of the conditions of the 
experiment . 

As discussed in Chapter IV, several models of monaural auditory pro- 
cessing make different predictions for the levels of concordance between 
observers. The major predictions for these experiments are: 

-i) Linear observers with non memory-specified reference-signal noise 
have equal correlations on and trials, 
ii) If the noisy linear-uncertain observer has a linear component in 

his decision variable (A ^ 1) then the absolute value of the corre- 
lation between the observer and the cross-correlator (with a noise-only 
input) is not zero and is greater on trials than on trials. 
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iii) Under the same conditions as (ii) the insertion of CW (carrier wave) 
into the noise cannot decrease the correlation between the observer 
and the cross-correlator (with noise-only input) . 

iv) The linear correlation between a filter-bank observer and the 
cross -correlator is zero on both and trials, 

v) The correlation between a noisy linear-uncertain observer and the energy 
detector (with a noise-only input) must be greater on trials 
than on trials . 

vi) The correlation between filter-bank observers and the energy 
detector should be greater on trials than on Hq trials, 

vii) Insertion of a continuous sinusoid (CW) into the noise background 
with the same frequency and phase as the signal should increase 
the correlation between filter-bank observers . 

5 . 2 Method 

The experiment was conducted at the Sensory Intelligence Laboratory, 

The University of Michigan. It involved the presentation of a 1000 Hz tone 
pulse, the signal, in a background of Gaussian noise. Three observers listened 
monaurally (one ear) to identical inputs through earphones. On each trial 
a random selector determined whether signal was to be present (H^) or not 
(Hq) in the noise background. The observers were asked to report on each 
trial their confidence that the hypothesis was correct. During the presen- 
tation interval the noise waveform was also gated to two electronic devices. 

The first device computed the cross -correlation of the noise with the signal. 

The second device computed the energy of the noise in a 50 cycle band centered 
at the signal frequency. The outputs of these devices were compared with 
observer reports on and Hq trials separately. 
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Apparatus. The experiment was programmed on a system known as N.P. 
Psytar which has been extensively described elsewhere (Green, Birdsall, and 
Tanner, 1957) . N.P. Psytar contains a white noise source, ocillators for 
producing tones, an automatic random number generator, amplifiers, attenua- 
tors, audio gates, and the necessary logic and timing circuits to completely 
automate the presentation of acoustic waveforms and record observer responses 
N.P. Psytar was augmented in the present experiment by two analog multipliers 
with outputs fed to gatable analog integrators, a digital voltmeter (Hewlitt 
Packard) with printed output (Hewlett Packard) and response sliders one 
foot long attached by a dial cord to a ten- turn linear potentiometer. The 
trial type, or was punched automatically on computer cards. 

A block diagram of the analog multiplier and integrator circuits is 
shown in Figure 5.1. The signal and noise sources shown were also used to 
generate the inputs to the earphones. The analog gates to the integrators 
were closed (ungrounded) simultaneously with the presentation interval. The 
gates across the integrator capacitors were opened at the onset of the presen 
tation interval and remained open until the offset of the observers' response 
interval during which the digital voltmeter recorded the stored charge on 
the capacitors. 

Following the response interval the digital voltmeter also recorded 
the position of the sliders by measuring the voltage drop across the slider 
potentiometers. The input to the digital voltmeter was determined by a 
stepping relay which was reset after each trial. The voltmeter was allowed 
250 ms to stabilize on each reading before the result was printed. 
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Fig. 5.1. Block diagram of Electronic Observers. CA is a current amplifier, 
DA a differential amplifier, I an operational amplifier in an 
integrator configuration, and G is an analog gate circuit. The 
differential output voltage of the multiplier is proportional to 
the product of the input currents. 
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The offset voltages of the integrators varied slightly from day to 
day but were quite stable during each two hour experimental session. 

The narrow-band filter used in the energy detector was a single 
tuned passive filter with center frequency at 1000 Hz and 3 db power 
points at 974 Hz and 1025 Hz. Therefore, the 3 db bandwidth was 
W 3db = 51 Hz and the equivalent square band-pass was approximately 
W = U/2)W 3db = 80 Hz'. 

Signal and noise levels. The detectability of a signal specified 
exactly in a background of white, Gaussian noise is appropriately measured 
by the index d = 2E/N Q which is the square of the sensitivity index 
of an optimum observer in the task, E is the signal energy and Nq is 
the noise power per unit bandwidth. The method used to measure the 
ratio is described in Green and Swets (1966, Appendix III) . In the 
conditions with a signal duration of 100 ms, d was equal to 28.8, and for 
signal durations of 40 ms, d = 28.5, In conditions with CW added to 
the background noise the CW had a level of 14 db above the average noise power 
density N^. The CW does not affect the computation of d since it is ignored 
by the optimum observer. 

Subjects. Three female undergraduate students served as observers in 
the experiment. Observer 1 (OB 1) had served in a previous experiment which 
required confidence judgments. OB 2 was also an experienced observer, but 
had not previously used the confidence mode of response. OB 3 had had no 
previous experience as an observer in psychoacoustic experiments. The 
observers were paid at a base-rate per hour commensurate with their previous 
experience. In addition, bonus points were computed on each trial and 
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were converted to money in such a way that for average performance an 
observer could, in effect, increase her hourly wage by 50%. However, all 
bonus payments were made after the completion of the experiment contingent 
upon continued attendance.. 

Payoffs. The bonus points were computed on each trial according to 
the . following foimula. 

trials: B = 1 - (1 - r^) ; 

H q trials: B = (1 - r^) , 

where B is the number of bonus points and r is the confidence rating as 
a proportion of distance along the slider scale. With this payoff scheme 
observers should maximize their subjective expected bonus by reporting their 
"true" subjective probability that signal was present on that trial. The 
bonus points for a day’s session were usually reported to the observers 
on the following day before their next session. 

Preliminary training. The observers were given four sessions each 
with 500 to 700 trials in which two response buttons were used. The 
responses were labeled "Yes" and "No" regarding signal occurrence. The 
two-button sessions were followed by eleven sessions in which four response 
buttons were used. The observers were instructed to use the buttons to 
indicate (1) "I am quite sure signal was presented", [2) "I am not certain, 
but I think the signal was presented", (3) "I am not certain, but I think 
the signal was not presented", and (4) "I am quite .certain that signal was 
not presented". In order to acquaint the observers with the way bonus points 
would be computed when they used the slider a modified bonus scheme was 
used in the four-button sessions. Also, the observers were told to keep 
in mind that later on in the experiment they would be using a continuous 
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slider to indicate their confidence and that they should think of the 
four buttons as being approximately evenly spaced along the scale of the 
slider. These sessions continued until it was ascertained that the a 
posteriori probability of given a response was a monotone function of 
the button^ number for each observer for four 'days in a row. The observers 
had some experience with each of the four experimental conditions described 
below. Following the four button sessions, the observers were given two 
sessions with the slider response before the data reported here were 
obtained. 

9 

Procedure. A trial, from an observer's point of view, consisted 
of four intervals in time, each marked by a separate neon indicator. The 
duration of a trial was about 5.6 seconds. The first 500 ms was a "get 
ready" period. The presentation interval of 100 ms in conditions I and III 
or 40 ms in conditions II and IV immediately followed. A two second response 
interval followed during which the observer was to position the slider to 
indicate her confidence that signal was presented^ After eleven" slider 
sessions the' observers complained that the response interval was too long. 

For the remaining five sessions the response interval was decreased to 1 
sec so that the total trial duration became about 4.6 sec. 

Each day's session consisted of five to seven blocks of trials. Each 
block consisted of 100 trials after which the observers were given a one to 
two minute rest. Halfway through each session observers were given a ten 
to fifteen minute break. Each session lasted approximately 1 1/2 hours. 

Sixteen experimental sessions were conducted, using the slider response. 
The first two sessions were for practice (from the experimenter's point of 

* Use of a mechanical analog to a continuous rating response has been used 
previously by Watson, Rilling, and Bourbon (1964). 
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view) . The data of two other sessions were omitted from analysis because 
of equipment problems. The remaining twelve sessions were divided into 
three sessions each for four experimental conditions. 

Condition I had a presentation interval of 100 rns and no continuous 
sinusoid (CW) added to the background noise. Condition II had' a 40 ms 
presentation interval with CW. Condition III had T = 100 ms and CW. 
Condition IV had T = 40 ms and CW. The three sessions of a condition 
were run on sequential days except- for session 3 of condition I which was 
the last of the experimental sessions. 

Preliminary data analysis. The trial -by-trial responses of each 
observer and device, which were' read by the digital voltmeter, were punched 
on computer cards and analysed by a preliminary data analysis program on 
an IBM 7090 computer. This program converted the response values to stan- 
dard scores for each observer and device. These scores were separated 
into two groups corresponding to trials and trials, respectively. 

The scores for the three sessions for each condition were then merged. The 
program generated many cut-points for the data and determined the frequency 
distribution of scores for each observer and device. From these distribu- 
tions it could be determined which cut-points would give approximately ten 
equally probable categories of response. The program then obtained the ten 
joint frequency distributions for each'pair of observers and devices for 
each of the four conditions on and trials separately. Thus, a total 
of 80 = 10 * 8 joint frequency tables were obtained in this way. The 
joint frequency tables were used to deteimine the level of concordance 
between observers and devices. 
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In order to obtain empirical ROC curves the cut -points were determined 
which would give approximately ten equally probable categories of response 
regardless of or H^. The program then determined the 2 * m (m = 9 or 
10) frequency table of hypothesis (H^ or H^) versus response category for each 
observer and each device for each of the four conditions. From these data 
tables the empirical ROC curves, the area estimate and several other' perfor- 
mance measures could be determined. 

5 . 3 Eesults 

Observer performance. The ROC curves for each of the four conditions 
are presented separately for each observer in Figures 5.2, 5.3 and 5.4. 

Each curve is based on approximately 1,800 trials. The ROC curves for the 
two electronic devices are not shown since they fell nearly perfectly along 
the chance line as expected, (Both devices, it will be recalled, were 
presented only the noise waveform sample on each trial.) 

The shape of the observer ROC curves indicate that they would be only 
poorly approximated by a straight line on double probability paper. If any 
generalization can be made it would be that the curves differ from straight 
lines with unity slope by having a slightly smaller slope and are somewhat 
concave toward the chance line. Also the curves appear quite similar in 
shape across conditions. This conclusion is most pronounced for 0B3 who 
had the highest efficiency in all conditions. 

The performance of all observers is better when CW is present. Observers 
1 and 3 show better performance with T = 40 ms when no CW is present, but 
show better performance at .T = 100 ms when CW is present. There appears 
to be little effect of duration on the performance of ‘observer 2. 
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Fig. 5.2. Empirical ROC curves for observer 1. The curves are based on 
1,800 trials each and are plotted on double normal -probability 
paper. The line of chance performance (P(H) = P(F)') in the 
task runs from the lower left-hand comer to the upper right 
of the figure. 
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Fig. 5.3. Empirical ROC curves for observer 2. 
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Fig. 5.4. Empirical ROC curves fir observer 3. 
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A comparison of measures of observer performance is given in Table 5.1 
for each observer under each experimental condition. The estimate of the 
area under the ROC curve based on the conditional correlation between signs, 
est P^, was computed according to Equation A1.17 in Appendix I. Trap P^ 
is the estimate of the area based on using the trapezoidal rule for integra- 
tion and was computed using Equation A1.18. d' z is the sensitivity index 
computed from est P, using Equation 4.33. d* is an index of performance 
which may be defined as 2$> -1 (P), where P is the hit-rate at which the ROC 
curve crosses the negative diagonal. Finally n is the efficiency of an 
observer computed from d^ and d g = 2 E/Nq according to Equation 4.37. 

For all observers and conditions trap ? A is a little less than est P^ as 
it should be. For the fairly large number of cut -points used here the trape- 
zoidal estimate is only smaller by about 1%. 

The values of d' are generally somewhat larger than d' This constitutes 
partial confirmation of the generalization that the ROC curves are concave 
downward . 

There is an interaction effect of CW with duration. The efficiencies 
are smaller for T = 100 ms than for T = 40 ms when no CW is present. 

This inequality is reversed when CW is present. The introduction of CW also 
substantially improves the performance of the observers. 

It was hypothesized that the use of the continuous rating response 
might have a depressive effect on observer performance. If so, the obser- 
vers should have shown somewhat better performance in the preliminary training 
sessions where only four response buttons were used. No such consistent 
differences were apparent using the index d^. 
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TABLE 5.1 

A COMPARISON OF MEASURES OF OBSERVER PERFORMANCE 

Condition 


I 

II 

III 

IV 


cw 


cw 


Observer 1 

100 

40 

100 

40 

est P A 

.818 

.850 

.953 

.930 

trap P A 

.807 

.839 

.945 

.923 

d z 

1.28 ' 

1.47 

2.37 

2.08 

d ; 

1.31 

1.45 

2.44 

2.32 

n 

.057 

.076 

.195 

.152 

Observer 2 

est P. 
A 

.841 

.855 

; .940 

.936 

trap P A 

.830 

.844 

.931 

.927 

d'* 

2 

1.41 

1.50 

2.20 

2.16 

d* 

1.51 

1.61 

2.23 

2.26 

e 

n 

.069 

.079 

.168 

.164 

Observer 3 

est P A 

.861 

.897 

.960 

.937 

trap P A 

.851 

.886 

.953 

.929 

d z 

1.53 

1.79 

2.48 

2.16 

d* 

1.54 

1.86 

2.54 

2.31 

e 

n 

.081 

.113 

.213 

.164 




- 74 - 


Observer Concordance . Values of tau, corrected for ties, were 
computed from Equation A1.13 using the 10 x 10 joint frequency tables 
described above- The observed values of tau between the cross-correlation 
and the other observers are presented in Table 5.2 for each condition. 

Maximum confidence intervals for each value were computed (Kendall, 1962, 

Eq, 4.12). In Table 5,2 all but three values have 50% confidence intervals 
which include zero. For the remaining three values 60% confidence intervals 
include zero. These intervals are generally considered quite conservative 
so that it may still be worthwhile to look for systematic trends in the 
data. 

There seems to be a very small but persistent positive concordance between 
the cross-correlator and the energy detector. An investigation of the 
noise source showed a slight skewness in the noise distribution around the 
zero amplitude value. This non-linearity in the noise waveform appeared 
to be the most likely cause of the slight degree of correlation found. 

For observers 2 and 3 the values of tau are a little higher on 
trials than when CW is not present. Even this inequality is reversed 
for 0B2 when CW is present. Thus there appears to be no systematic basis 
upon which to accept hypothesis (ii) in Section 5.1. Furthermore, there 
appears to be no clear increase in the correlations when CW is present 
as predicted in hypothesis (iii) . Thus, we may conclude that the data pro- 
vided by the correlations between the cross-correlator and the human observers 
can be explained either by the noisy linear-uncertain model with no linear 
component, i.e., a noisy energy detector, or by the filter bank model. 
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TABLE 5.2 

observed tau values between the -cross correlator 

AND. 


THE OTHER OBSERVERS 


Condition 


CC vs ED 

i ac 


CC vs OB^ 


CC vs OB 2 


CC vs 0B 3 


H 1 


H 1 

H, 


H 1 


H 1 

H, 


I 

cw 

II 

III 

IV 

CW 

100 

• 40 

100 

40 

-0014 

0838* 

. 0242 

0085 

0082 

0225 

0181 

0162 

-0131 

0052 

-0119 

-0197 

-0597* 

-0056 

0080 

0082 

0160 

0418* 

-0086 

-0086 

-0380* 

-0160 

-0273 

-0273 

0132 

0247 

-0034 

0288 

-0045 

0015 

0094 

0293 


* These values are significantly different from zero at the 
0.5 level of confidence but not at p < 0.4. 

Decimal points 'Emitted. 




The remaining tau values were converted to rough estimates of the 
linear correlation using Greiner's relation (Equation 4.17). The 50% 
confidence intervals for the tau values were also converted to correlation 
confidence intervals. The correlations between the energy detector and 
the human observers are presented in Table 5.3. Over half the correlations 
at T.= 40 ms fail to be significantly different from zero at the 0.5 
level of confidence, while none of the correlations for T = 100 ms are 
zero at this level. Thus, there is a pronounced decrease in the correla- 
tions for the shorter durations regardless of whether or not CW is present. 

With no CW the correlations for trials are greater than for H Q 
trials. The differences are significnat for 0B1, but not for 0B2 or 0B3. 
There appear to be no other consistent differences between the correlations 
which are also reliable. 

No specific hypothesis was offered in Section 5.1 to attempt to predict 
the effect of duration. This is because such predictions can only be made 
for the linear uncertain model by assuming specific parameter values . 

This topic is further discussed below. The weak evidence that the correla- 
tions are greater ori than on H Q trials, at least when no CIV is present, 
tends to reject the noisy linear-uncertain model in favor of the filter-bank 
model according to predictions (v) and (vi) . 

The inter- observer correlations and their 50% confidence intervals are 
presented in Table 5.4. The relations between the correlations are highly 
stable, regardless of the pair of observers considered. With a single minor 
exception (OBI vs 0B3, Condition II vs IV) ' correlations on trials indicate 
lower correlations on 11^ trials within comparisons for an observer pair. The 
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TABLE 5.3 

CORRELATIONS BETWEEN THE ENERGY DETECTOR 
AND THE HUMAN OBSERVERS 1 " 




Condition 




I 


II 

III 


IV 


cw 



cw 


100 


o 

100 


40 ' 


ED vs OBI 

H 1 P 

272 

129 

325 

-017* 

Cl 

(220,323) 

(079,179) 

(279,369) 

(-068,033) 

H o p 

151 

026* 

133 

045* 

Cl 

(101,200) 

(-024,075) 

(085,180) 

(-005,095) 

ED vs 0B2 

Hj P 

204 

056 

166 

-035* 

Cl 

(150,256) 

(006,106) 

(118,214) 

(-085,016) 

H 0 p‘ 

• 166 

-036* 

094 

057 

Cl 

(116,216) 

(-085,013) 

(046,141) 

(007,107) 

ED vs OB3 

h 2 i 

228 

094 

101 

001* 

Cl 

(175,279) 

(044,144) 

(052,149) 

(-049,051) 

H„ P 

217 

048 

164 

027* 

Cl 

(168,265) 

(-001,098) 

(117,211) 

. (-023,077) 


f Decimal points ommitted. Each estimate based on approximately 

1800 trials. 50% confidence intervals (Cl) are given in parentheses. 

* N.S. for p 0.5, 
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differences between correlations on and trials with CW are all quite 
significant (p .05). The differences when CW is present are smaller 
but there are no violations of sign. Thus, the hypothesis (i) of linear 
observers with no reference signal noise is strongly rejected. 

The Hj-Hq correlation differences are larger with CW than with Gtf. This 
effect appears primarily due to a decrease in correlations on H trials. 
However, with a single exception (the same as before) correlations are 
higher with CW present for corresponding durations. The decrease on 
trials present a serious difficulty for the filter-bank model according 
to hypothesis (vii). The CW signal lies in the center of the signal spec- 
trum by construction. It could be argued that CW should increase the 
correlations more on trials than on trials since the maximum narrow 
filter output will be more uniformly distributed in frequency when no CW 
is present. However, it is inconceivable how introduction of CW could 
decrease the correlation on trials. 

It may be inquired whether the relation between inter-observer corre- 
lations can be predicted from observer sensitivity measure as described in 
Section 4.8. The procedure described there required non-zero correlations 
with the cross -correlator which is doubtful considering the present data. 

It was decided, therefore, not to attempt the prediction. However, the 
equation (4.39) derived for linear-uncertain observer suggests that the 
weighted difference for correlations on and trials should be related 
to the weighted sum of the sensitivity indices. Examination of the present 
data shows that with a single exception (OB2 vs OB3, Condition III), 

P a8^ " p aB^ * s a mono ' tone decreasing function of the sum d’ + d’ across 

Z a ’ Z B 



TABLE 5.4 


INTER-OBSERVER CORRELATIONS'* 


Condition 



I 

• II 

III 

IV 


cw 


CW 



100. 

40 

100 

40 

OBI vs 0B2 


* 



H 1 P 

686 

605 

472 

519 

Cl 

(650,719) 

(567,640) 

(431,512) 

(477,558) 

H 0 > 

313 

326 

381 

328 

Cl 

(265,359) 

(279,371) 

(337,423) 

(281,373) 

OBI vs 0B3 





H x p 

736 

691 

498 

597 

Cl 

(703,765) 

(658,722) 

(458,537) 

(559,633) 

H 0 P 

337 

400 

437 

353 

Cl 

(290,383) 

(355,443) 

(395,478) 

(307,398) 

0B2 vs 0B3 




‘ 

Hj_ P 

660 

671 

513 

495 

Cl 

(623,695) 

(636,703) 

(473,551) 

(453,536) 


250 

291 

382 

373 

Cl 

(201,298) 

(244,337) 

(338,424) 

(327,417) 


* Decimal points omitted. Each estimate based on approximately 

1800 trials. 50% confidence intervals (Cl)" are given parentheses. 
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conditions for each pair of obervers. I can give no interpretation to this 
result . 

5 . 4 Discussion 

The noisy linear-uncertain model was developed and proposed as a 
plausible alternative to the filter-bank model. The linear component in 
the linear uncertain model is a necessary ingredient in order to account ■ 
for Ahumada's thesis findings as was seen in Chapter IV. Thus, for the 
linear uncertain model to remain a viable alternative it is of importance 
to consider the first three predictions of Section 5,1. The first, that 
correlations between observers are equal on and H Q trials, is unequivacably 
rejected by the data. The conclusion is that human observers do not perform 
simple linear operations on the input; at least there must be noise in the 
memory-specified reference signal. The first part of the conclusion has 
been verified repeatedly using indirect comparisons. However, the possibility 
that the observer performs noisy linear operations on the input has never 
previously been investigated using empirical comparisons. 

The predictions • (ii) and (iii) are an attempt to face directly the 
possibility of a linear component in the observer's decision variable regard- 
less of whether cr not there is reference-signal noise. There appears to 
be little evidence of correlation with the cross -correlator at all. Thus, 
the further questions of whether the correlation is greater on trials 
than on trials or \yhether the correlation is increased by insertion of 
CW into the noise are irrelevant. A rejection of the hypothesis of a linear 
component in the decision variable serves to reject the noisy linear-uncertain 
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model as well(in connection with Ahumada's findings). An attempt can be 

made to explain the small correlations with the cross -correlator by assuming 

that the linear component is present but has a small weight, i.e., that 

A « X , However, this is not sufficient to save the model. As was seen 
a a . 

at the end of Section 4,7 X ' should be somewhat greater than A^ for the 
explanation of Ahumada's finding to be reasonable. Another possibility 
for saving the linear component is to assume that the reference- signal noise 
i s quite large. But cannot be very large before the correlations 
among observers are depressed. However, there appears to be no way to 
positively reject this latter assumption with the present' data. 

The linear-uncertain model, temporarily preserved by the assumption of 
considerable reference signal noise, must still meet the fifth prediction, 
namely, greater correlation on H Q trials than on trials with the energy 
detector. No significant differences were found in this direction although 
some differences in the opposite direction were weakly significant. Thus, 
the correlations with the energy detector also provide some evidence for 
rejecting the linear-uncertain model. 

The evidence provided by the correlations with the cross-correlator, 
the energy detector and Ahumada's variable bandwidth energy detector lead 
to the conclusion that the noisy linear-uncertain model, including its special 
cases, is rejected as an adequate model of human monaural auditory processing. 

The filter-bank model survives predictions (iv) and (vi) that observer 
correlation is zero with the cross-correlator and that the correlations with 
the energy detector are greater on trials than on trials, respectively. 
However, the final prediction, that inter-observer correlations- should increase 
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with the insertion of CW into the background noise, is not verified. The 
insertion of CW does cause a slight increase on trials but also a 
pronounced decrease on ^ trials/ Although this is the first clear evi- 
dence against the filter-bank model it cannot be taken lightly. The pre- 
diction is strong even though formal development of the filter-bank model 
has - not been made. 

It might be argued that the decrease in correlations with the insertion 
o f CW is an artifact caused by the improved performance of the observers . 

Since they were asked to give their confidence that they would be right if 
they had said H^, one would expect a greater concentration of responses near 
the ends of the slider in CW conditions. This concentration could cause 
a reduction in the association between the observers' responses. An examina- 
tion of the response distributions across the slider scale did show increased 
grouping of responses towards the ends of the slider. However, if the drop 
in association is caused by this then tau computed on 2x2 joint response 
tables with marginal cut-points at the medians should be relatively unaffected 
or increase (according to prediction (vii)). These tau values were computed, 
as usual, correcting for ties. With CW there was a slight increase in the 
range of inter- observer correlations between and II Q trials. In the CW 
conditions, rather than an increase, both correlations on and trials 
decreased slightly with the range staying approximately the same as when 
computed on the 10 x 10 joint frequency tables. Thus, it is concluded that 
the result is not an artifact of response grouping. 

There axe two experimental results which elude explanation by the models 
considered so far: (a) the decrease in observer concordance with the energy 
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detector when the signal duration is decreased, and (b) the lower inter- 
observer concordance on trials when CW is present. . 

The first can be given an ad hoc explanation in terms of a modification 
of the simple energy or envelope model. The modification is described by 
Jeffress (1967) . His "leaky integrator model" has a narrow-band filter 
followed by a square-law detector (or perhaps a linear rectifier) . The 
output is exponentially weighted with a fixed rate of decay and integrated 
continuously in time. It is presumed that the observer’s decision variable- 
is the value of the integral at the termination of the signal presentation 
interval. Since the rate of decay is constant (about 100 ms according to 
Jeffress) short signal presentation intervals will cause part of the noise 
waveform not in the presentation interval to be integrated into the decision 
variable. However, the energy detector of the present study has an integra- 
tion time equal to the duration of the presentation interval. Thus, the 
decrease in the amount of common noise for the observers and the energy 
detector could cause the decrease in concordance at shorter durations. 

The leaky integrator model, of course, suffers the same difficulties 
as the linear-uncertain model in explaining Ahumada's finding and result 
that correlations with the energy detector on trials are greater than on 
Hq trials. Further, Jeffress’ model apparently cannot account for the 
decrease in inter-observer correlations with the introduction of CW at 
constant durations. 

The leaky -integrator model and the filter-bank model can be combined. 

The leaky integrator computes a short-term power-like quantity for the output 
of a single narrow-band filter. Assuming that the bandwidth is somewhat 
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smaller than the equivalent square band-width of 90 Hz estimated by Jeffress, 
a bank of such filters could be postulated. The filter-bank in this case 
is computing a portion of the short-term power spectrum of the input. It 
is reasonable to extend Ahumada's decision variable to be the maximum 
output in time as well as frequency. I shall call this model the leaky 
filter-bank model . 

The leaky filter-bank model may be viewed as appending a particular 
decision rule to a processor which continuously computes the short-term, 
frequency- limited power spectrum of the input. Such a processor could 
also be implimented by taking the Fourier transform of the short-term auto- 
correlation function of the input. Thus, the processor of the leaky filter- 
bank model is quite similar to a suggestion by Licklider (1951) that the ear 
performs a short-term autocorrelation of the input in monaural listening tasks. 

In summary, the noisy linear- uncertain model is unable to account for 
most of the results from direct comparisons between observers. A leaky form 
of Ahumada's filter bank model is able to give a qualitative account of most 
of the data from direct comparisons, but has serious difficulty with the 
finding that the concordance between observers decreases on signal trials 
when CW is added to background noise. 

5, 5 Summary 

Predictions of the level of correlation between observers derived 
from the linear-uncertain model and the filter -bank model are compared for 
several experimental conditions. In the experiment the decisions of human 
observers are compared with the outputs of two electronic devices . The 
first device is an analog multiplier which computes the cross-correlation 
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between the signal and the noise waveform sample on each trial. The 
second device is an energy detector which computes the energy of the 
noise waveform sample' during the presentation interval in a narrow frequency 
band centered at the signal frequency. 

The linear-uncertain model predicts that the correlation between the 
human observers and the cross -correlation is not zero and should increase 
when a continuous sinusoid is added to the background noise. Neither of 
these predictions is verified. Since the energy detector receives only 
noise at its input in this experiment, the model also predicts that the 
correlation between the observers and the energy detector should be less 
on trials when signal is present than when it is not present. The results 
show the correlations to be weakly significant in the opposite direction. 

t 

It is concluded that the linear -uncertain model and its special cases , the 
linear, energy, and envelop models, represent an inadequate approximation 
to the actual form of human monaural auditory processing in detection tasks. 

Predictions from the filter-bank model agree with the above results, 
but cannot account for an observed decrease in inter-observer correlations 
when the signal presentation interval is shorted. A modification of the 
filter-bank model is suggested to account for this discrepancy. 

A final result remains unexplained by any of the models considered. 

A decrease in inter- observer correlations is found when a continuous sinu- 
soid is added to the background noise for either of two signal durations. 

It is emphasized that this unexpected finding implies that there is a serious 
deficiency in current models of monaural auditory processing in detection tasks. 



APPENDIX I 


A NONPARAMETRIC ESTIMATE OF THE AREA UNDER THE ROC CURVE 
A1 . 1 Theory 

The notation and definitions of Sections 4.3 and 4.4 will be used 

here . 


With sample decision values or response categories ties can occur. 

The marginal probabilities of a tie are given by 

P(T a ) = PCa.j = 0) 

P(T g ) = P(b i;j = 0). 

Then it is easily shown that 

VarCa i;j ) = 1 - P(T q ) 

Var(b ) = 1 - P(T ), 
ij p 

whereas the expression for the covariance is unaffected by ties: 


CovCa^j ’ b ij) = PCS) - PCD], 


as in (4.9) . 

Therefore the estimate of tau (preferred by Kendall, 1962) is given by 

est i = est Corr(a. .,b. .) 

ij ij 

(Al.l) 


= PCS) - p (p) 

/[I - P(T a )][l - P(T g )f 

where the probabilities refer to observed relative frequencies. It has 
been found that tau estimated in this wav is often relatively unaffected by 
grouping of the variables. That is, if a pair of continuous variables are 
categorized into 5 or 10 equally probable values we may expect the three corresponding 
estimates of r to be quite close in numerical value. Of course, it is 
impossible to obtain consistent estimates of tau using grouped data unless 
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the underlying continuous bivariate distribution is known and the grouping 
procedure is under the control of the statistician. 

The estimate of gamma used by Goodman and Kruskal (1964) can be viewed 
in exactly the same way as Kendall's estimate of tau. Since 


Y = Corr(a. . ,b. . j a. . £ 0, b. . ^ 0) 
i 1 xj 1 .11 lj 


the estimate is given by 

est y = est Corr(a. . ,b. . I a. . ^ 0, b. . f 0) 
il ij' xj il 


P(S) - P(D) • 
1 - P (T) 


where P(T) = P(T q U t ) . 

In connection with Theorem 4.1 these estimates suggest an appropriate 
estimator for the area under the ROC curve which attempts to correct 
for the grouping of an observer's decision variable. The theorem proved 
that in case the observer's decision variable is continuous 

P A * (y + D/2. 

Since is continuous a.^. cannot equal 0. Thus, in this special case 

Y - Corr(a. . ,b. . |b. . i 0) . 
r lj ij 1 lj 

According to the preceding estimates we should take 

- est P = -^-[est Corr(a. . ,b. . ]b. . ^0) + l] . 

A 2 l lj’ xi 1 ii 

Now 


and 


Cov(a. . ,b. . 
H 11 



i 0) = 


PCS) - P(D) 
1 - P(T p ) 


Var(b^.|b.^ ^ 0) =1, 
Var(a..|b. . + 0) - 1 - PCTjb^ / 0) 


9 


Therefore 
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est P A - 


Hr 


P(S). - P(D) 


p(T a i V [1 ' p(T e }1 




(A1.2) 


We can obtain a somewhat simpler expression to work with using the following 


identities. 


But 


p (TjT 6 ) 


P <VV 

1 - PCTg) 
pcy - p(T a ,T e ) 
1 - P(T g ) 


so 


and 


P(T T ) = P (T ) + P(TJ - P(T) 
ct p a p 


P (T) - P(TJ 

P(T JV ■ 


i - pit It ) = - 1 -~- p , 

V W 1 - P (T ) 

p 


Thus (A1.2) becomes 


est P, 


■*[ 


P(S) -P(D) 


{[1 - P (T) ] [ 1 - P (T g ) ] } 


1/2 


+ 1 


(A1.3) 


A1.2 Computation 

In this section I develop a common notation for the computational 
formulas*of est P^, est t and the trapezoidal estimate of the area. 

Let the response categores for two observed decision variables R and 
R' be Rp R 2 , . R^ and Rj, Rj,, R^, respectively, where R^ and RJ 

indicate the greatest confidence in and R^ and indicate the greatest 
confidence for the alternative hypothesis Hp. The observed joint frequency 
table is then in the following format: 


* Kendall (1962) also gives a similar computational scheme for est t. 
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R i 


R 2 


R n 


R, 


R, 


R 


m 


hi 

f l2 

• e • 

f m 

f 21 

f 22 

• • • 

f 2n 

■ 

Bj 

• « 0 


hi 

f m2 

to* 

f 

mn 


m' 


•1 *2 *n 

To compute P(S) = P(a^b^ > 0) we may take each joint observation and 
compare it with any other observation in the table. The comparisons which 
contribute to P(S) from an observation in a cell (i,j) are all those obser- 
vations in cells below and to the right of (i, j) . The total number of 
such observations is 


n 


I I 

h=i+l £=j+l 


c kr 


Now the comparison is made with each, of the f„ observations in cell (i,jj. 
Therefore the total number of comparisons in the numerator of P(S) is 

m-1 n-1 m m 

s ' l l fii I i f k £ C 41 - 4 ) 

i=l j=l k=l 1=1 K 

The denominator is simply the total number of comparisons 

N = f (f - l)/2. (A1.5) 

Thus 

P(S) - S/N. ' (A1.6) 

The probability of disagreement in signs is computed similarly as 

P(D) = D/N, (A1.7) 


where 
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D = X X .1 X f k v 


ra n i-1 j-1 

y y 

u U “i L L 

i=2 j=2 13 k-1 1 = 1 

Now the number of ties in the decision variable R is simply 

m 


so that 


Similarly 


T a s 

1 = 1 


P(T ) = T /N. 
v or a 


n 


• l f.iK.-i - D/2 
i=l ‘ ] 


and 


(A1 . 8) 


P( V = T 8 /N - 

Thus from (Al.l), after slight rearrangement of the N's, we have 

S - D 

est x = 

AN - T ) (N - T ) 
a p 


(A1.9) 


(A1 . 10] 


(Al.ll) 


(A1.12) 


(A1 . 13) 


as the computational formula for est x. 

To estimate the area the ROC curve we use the data format given below 


R 1 R 2 


R 


m 


H, 




f ll 

f 12 

• • • 

f i„ 

f 01 

f 02 

• f e 

f 0m 


f, f 0 

■>1 • *2 


f f 

•m 


Now S and D simplify to 


m-1 m 

s = h fu jL / 01 

m-l m 

0 - l f 01 l f lj 
1=1 U1 j=i+l 1 


(A1.14) 


(A1 . 15) 
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The number of ties T in either variable is 

T = N - (S + D) . 


Now, 


Thus 


(A1 . 16) 


T e ■ 

’ t £ l. * - Cfj. * f 0 .)]/2 , 

- ( f f. * £ o- ' f - )/2 


N - T, 


1 - P(TJ = 


N 

£ (f - 1) - (f? + f? 

• o v ^ 1* 0* 


- £ ) 

• • 


2N 

f 2 - £? - f n 

• • 1 • 0 * 

‘ 2N 


The area estimate becomes, upon substitution and rearrangement. 


est P A “ 1 S ' P * \ ■ (A1.17) 

sm * wcfj. - ff; - £ 0 .) 

This estimate of the area is a distinct improvement over the one I 
proposed earlier (Wilcox, 1967) which was based on using the estimate of 
gamma given in Section Al.l. 

For comparative purposes the computational formula for the area using 
the trapezoidal rule is given below 


trap P A 


m 

S *T ! f 


i=l 


li f 0i 


f l- f 0- 


(A1.18) 


For two response alternatives this reduces to the probability of a correct 
response 

P(C) ='P(H)P(H 1 ) + [1 - P(F)]P(H 0 ) 

if and only if P(H^) = P(Hg) = 1/2. This P(C) , of course, is not to be 
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confused with the probability of a correct response in a two interval -forced 
choice procedure which, under certain assumptions, is P^. 

.For less than 5 response alternatives (A1.18) provides a rather gross 
underestimate of the area under the ROC curve. With as many as 10 response 
alternatives that are approximately equally spaced in probability trap 
is almost indistinguishable from est P^« 



APPENDIX II 


STATISTICS OF LINEAR UNCERTAIN DECISION VARIABLES 
The marginal and joint moments for linear -uncertain observers reported 
in Chapter IV are derived here. 

A2 1 ’Preliminaries 

The decision variable of the linear-uncertain observer a is, from 
Equation 3.27, 

Z ~ ~[A y'D y + X u’D (2z/ - y )]. 
a N q l or aP a a a v a cr J 

To compute the moments of z , it is more convenient to express z^ in summa- 
tion notation. Since D a is a projection operator, there exists an orthogonal 
transformation of F in which the equivalent projection operator to D q is a 
diagonal matrix with entries on the diagonal either 0 or 1 . Since the trace 
of D is the dimensionality of the subspace F^, the number of l's in the 
equivalent matrix is m a = 2W q T - 2 6 q . We shall let = min{m a ,nig} under 

the assumption that there exists a subspace F _ in F for which F aB = V" F 3 

and F „ - F or F „ = F„. Thus, the decision variable z (with no inter- 
aB a aB $ ct 

nal noise) in' the equivalent representation (i.e., the frequency representation) 


becomes 


z = rr— T FA y? + A p . (2y. - y .)] 
ct N„ .t L a-’i or ai ^ 7 i ai' J 
0 i=l 

Several further preliminaries are necessary to ease the burden of the 


derivations . 


Definition. The joint central moment of type (i,j,...,k) for a sequence 
of random variables (x,y,...,z) is the expectation 

U,- , (x,y,...,z) = F{[* - FCx)] 1 ^ - ^Cy)] j ... [z - 

Thus, MjCx 2 ) is not the mean of x 2 ; y 2 Cy) = y i;i (y,y) is the variance of y; 
and lijjCXjy 2 ) is the covariance between x and z = y z . 


93 - 
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(The mean vector y^ of an observer’s prior distribution will always 
contain a Greek subscript and so should cause no confusion with the nota- 
tion for joint moments.) 

Let a s b, and c be independent random variables. We observe that if 
w = ax + by + c where a and b are independent of x, y, and z, then 

y ll^ W,5 0 = P u (ax + by + c, z) 

= ^(a)y 11 (x,z) + ff(b)y 11 (y,z) + y u (c,z). 

Further if c is a constant, or independent of z, then y^(c,z) = 0. Of 
course, these relations may be generalized to joint moments of type (1,1, . ,1) . 
Finally, we recall that 

y u Cx,y) = S’(xy) - ff(x)£(y) . 

Proposition. Let y have a normal distribution with mean zero and variance 
o 2 s ^2^' T ^ en » odd moments of y are identically zero, and the even 
moments of y are given by 

h (2r) Cy) = SCy 2r ) = a 2r (2r)!/2 r r! 
for r a positive integer. 

A proof may be found in Kendall and Stuart (1958, p. 60). 

Corollary. Let y^ be a sample of the modified noise vector, i.e., 

y i = n i " V • Then > 

£(y? r ) = N^(2r)!/2 2r r!. 

The corollary follows from the fact, found in Chapter II, that 

P 2 (y i ) = n q /2. 

Three independent normally distributed internal noise sources are con- 
sidered. is added to the input y. . e 2 is added to the memory-specified 
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signal vector y . e_ is added to the decision variable itself. The variances, 
a 3 

with convenient scale factors are 

• f 2 frii> " N oV 2 ’ 

p 2 ( - e 2i' ! = W 2 ’ 

w 2^ e 3^ = V 3‘ 

The mean of each error source is zero. 

The decision variable with e ^ and e ^ is 


z~ = r; — [A (y + e ,) '2? ( w + e ,) 
« N q l a^a al' a^a al 


CA2.1] 


* X a Cp « * %2>' D J 2 ^ a * V 5 ’ (l V * e a2 )] ’ 

where the input y has been given a subscript to allow for the possibility that 
two observers may have different inputs. The noisy decision variable with 
criterion variability is 


a 


Z * ^ 

a a3 


(A2.2) 


The decision variable for a second observer g is obtained from z* by replacing 
each a with the subscript 8. 

A2.2 Derivation of the Mean 

The mean z* may be found directly. Since Z?(e 3 ) = 0, 

E ( e a l) ~ s and e a.l’ e a2 * an< * are rautua liy independent, we have 

E( 2 .) - g lz j 

■ l V E KW * 

+ X [2y'Z> E(y ) - v'D y - E(_e'D e 0 )]} * 
a a a w a a a a a2 a a2 J 

- ~ {X [E(y'D y ) + 6 N.V , 1 
Nq a 1 y a a^a a 0 al J 

X [2y'P E(y ) - E - 6 N.V _] } . 

. a L a a v;7 cr a a 0 a2 J 

When a signal s ^ is present y^ = y + s . Then, defining, as in 

Chapter IV, 


y = s 
s a 
a 
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R aB V a°a3V r aB " 2R aB /N 0 5 

E = R , d = 2E /N. 
a aa a a 0 


we have 


£( z*) = — [X (6 N n + E +6 N V .) + A (2R - E + 6 N n V „)] , 

v or Nq L a a 0 .. a 0 al J a k a a a 0 a2•' J, 


a 


and collecting terms, 

E(z*) = X [6 (1 + V .) + d o /2] + T [r „ - d /2 - 6 V _] 

v n J (]J L Q v Ctl^ fl J ft ^ raC ft* rsi fti V J 


a 1 as 


a a 

A2.S Derivation of the Variance and Covariance 

The covariance between two observers is 


a a2 J 


(A2.3) 


v llK’ Z V = v ll (Z a + e a3’ Z 3 + W 


= V 11 ( V V* 


CA2.4) 


Let ii* = y + e . and p* = p + e and similarly for observer B. 

^ n rv /vl /v rv n s 


. >1 w-iiu K “ H ’ o A « 

'a '■'a al a a a2 

Then the covariance becomes 

u n<vv - ^ vvw: * \ k ,d ^k - v p - 
0 vrvi + v* B ,B 6 f2 n - “P 1 - 

This may be expanded into four terms: 

N?p 1t Cz ,z„) = A A u [y*'D y*>yVD 0 yt] 

0 ll v a’ B a B 11 y a a y a y B 3 B 

• * - ' , « ) -»b'Vs 1 

♦ i ,Vni»: , W' t 5 , v 2, « - 

* ' V V-W 2 n - I 51 

' ’ Vb T 1 * Vs T 2 + l r/[J T 3 * V e 4- 


(A2 . 5) 


Each of the four terms will be evaluated for two special -cases, namely 
where the observers are different (a ^ B) and where they are the same (a = B) • 
The latter, of course, is the variance. Further, we let u = y + s , and 
y & - y + Sg, and so consider the possibility that the observers may be 
presented different signals. Notice that here y denotes the modified noise 
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vector n - u n . The first term, T , will be evaluated in some detail; for 
the • other terms the discussion will be brief. 

Evaluation of Term 1 . 

We first rewrite the moment' in summation notation for the appropriate 
equivalent representation space: 

T 1 ■ “ntCVM'W 

m m fl 


m a mg 

- .1. 

1=1 J=1 J 


Now, when i f j the random variables in the brackets are independent regard- 
less of the case being considered, so that such y^ terms are zero. Moreover, 

by assumption, either F is a subspace of F or the reverse. Thus, in the 

a p 

equivalent representation space the summation extends only to m^ = minfm^jm^). 
These comments imply that may be simplified to 


m a 

T i - X -ntO'JpMyJi) 2 ] 


i=l 

m o 

aB 

' * l . T n- 
1=1 

where the definition of is implicit. 

For a f $, substitution for y*^ and y*^ yields 

T li = "u^i "■ e ali) 2 -f’'8i + W’ 1 

* >-lltC)d< * + e ali <y 8i * + e2 ” ] 


ll lv7 ai 


Bi Bli Bli 


However, if a = B, we have 

T-i * = Un Cy 2 . ,y 2 .) + 4y .. (y .e y .e + y, , (e 2 , . ,e 2 , . ) , 
li 11 y ai ai 11 ai all ai all 11 all all ■ 

where the obviously zero terms have been dropped. For the second term on the 
right of the latter expression we find 
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' - t^ci^ai” 2 

■ E < y ai> Val /2 ’ 


since = 0. For the last term, 


u n (e ali’ e aH ) = >‘ 2 (e ali ) 

= S(a'ii) - [E(e^ u )] 

/ N V . \2 , N ft V \ 2 

•H-^) - (^) 

- K 0 V al /2 ’ 

where the proposition was used to obtain 

Now collecting terms again, for a = 8, 

T|. = yil (y 2 .,y J + 2£ , (y 2 . )N n V . + N 2 V 2 /2. 

li 11 / ai ,/ ai y ai 0 al 0 al 

Covariance. Since y . = y . + s . and y a . = y. + s fi . we have 

1 3 'cci J i cti px i pi 

T li = 

* "nt^i + 5 «i )2 ' ty i + s 8i’ 2j 

■ ‘ , llt y i" y i] * 4s ci=Bi l, ll fy i ,y i 1 

- 2 CV 2)2 - 4 s ai s Bi t V 2 > 2 

* N 0 /2 * 2N 0 S ai S Bi- 


Therefore, 


m a8 

T i - 1 T n 


i=l 
a a6"0 


m 


a6 


= m N^/2 + 2N J s . s . • 
«« n 0 ai pi 

i=l 


Variance. 


= 8 0 N2 + 2N.R 
a8 0 Os s 
a p 


t, . = u [ (y - + s O 2 , (y- ■*■ s . ) 2 1 

li ll LV i ai w i ai 

; .) 2 jN n V , 
ai 0 al 


* 2E l(y, * s.h) 2 )^, * NgV^/2 
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Therefore, 


= K n Cy?,y?) - ^PjjCyj.yp 

,[E(y|) + s^]N 0 V al * N^/2 

- "o ' 1 * V al * 2 V /2 + 'Vai ' 1 + V 

T i " s « N o a * v «i> 2 + 2N o E s » + V- 


Evaluation of Term 2 


m a8 

T 2 = Jj i 


m 


a3 




i=l 


U lu ai' 


m 


a 3 


^ f 2T 21i " T 22i^ 


i=l 


T 21i = V U^ai + e «2i )( y«d * e ali^ (y Si + W' 1 

= p ai y ll < - y ai ,y 3i^ ) + 2y ai P ll ^ali jy Bi e Bli^ 

T 22i = p ll^ p ai + 6 a2i )2 ’ (y 3i + W' 3 

= 0. 


Tgj Covariance . 


T 21i * Vll^i * ^i-^i + s 8i ]21 

- M ai [2s Bi l, ll ( V> r i )1 
‘ Vci s ei- 


Therefore , 


m 


T ? ‘ 2N n I 


aB 


0 . L 1 y oi S 3i 

i=l 


= 2N n R 
0 as. 
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Tg, Variance 


T 21i = y ai y ll [y i + S ai j(y i + S ai )2] 

+ 2p ai y ll^ e ali ,(y i + S ai )e ali^ 
N 0 y ai S cd + 2y ai S ai y ll ^ e ai ,e ai^ 


= N n y .s . Cl t V .) 
0 ai ai^ al' 


T 0 = 2N n R (1 + V J 
2 0 as 1 al J 
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Evaluation of Term 3. 

Because of symmetry is with £ replacing a.- 
T Covariance , 


T s , Variance . 


Evaluation of Term 4 


T 3 " 2N 0 R Bs 


T 3 * 2N 0 R Bs V * V 

P 


t 4 - - k'\k- 2 *v b ? v ■ tvs* 

“aB 

" Jj - 2 ‘ , u t,, Si y Ji' Cl 'ei )21 

- 2,, ll [( > , Si :)2 "> , 6i y 8i 1 + "llf ( ’ , k’ 2 - (p 8i )21 


“ .1 l 4y 4l - 2 t,2 - 2T 43 4 V • 
1=1 


T 41 + e a2i^«i + + + W 

= U ai y ei { y ll [ y ai ^ + ^ ll ^ ali ’ W 1 

+ y ll^ y ai e a2i ,y Bi e g2i^ + y ll ^ e a 2i e ali ,e B2i e Bli^ 
T 42 = P ll^ U ai + e a2i' ,fy ai + e a li 5 5 (y pi + e £2i^ 

= v ll (y ai e a2i ,2v 3i e e2p* 

By symmetry to J.- 
» 

T 43 =y ll (2y ai e a2i’ y Bi e 62i :) 

T 44 = y ll^ u a i + e a2i 52,Cu Bi + e B2i 3 . 2 ^ 

^ 4w ll [u ai^2i' y BiVi ] + y ll [e a2i> e B2^ 

T ^ Covariance . 

T 41 = y ai ll Bi y ll^ y i + s ai ,y i + s Bi^ 

" VaiV 2 ' 

T 42 " T 43 = T 44 = ° 



- 102 ' - 


Therefore, 


T„ = 2N R Q 
4 0 ag 


Variance. 

= n2 


T.. = y 2 . {p. - [y . + s . ,y. + s .1 + y,- [e . . ,e 

41 ai ll l/ i ai ; x ax J 11 1 all’ .alx J 

+ M 11 [Cy i + S ai )e a2i’ ( >i + s cd 5e a2i ] 

* u ld e a2i e ali' e a2i e iild 

" N 0 U «i a + V al )/2 * hl^i'W’YW * s ai y ld e a2i’ e a2d 
+ S(e a2i’ e ali> 

* * V ad + S ai V „2] /2 * N 0 V «2 C1 + V „d /4 ' 

T.., =' 2y. . [ Cy. + s .)e ,y .e 

42 ll LW i ax' a2x* ax a2x J 

= 2y .s .y n , [e ,e -.1 
ax ax 11 1 a2x a2i J 

= N n y .s .V . 

0 ai ai a2 

T.„ = N_y .s .V , 

43 0 ai ai a2 


44 


4y 2 .y 11 [e „.,e _.] + y^fe „.,e 
ax 11 L a2x a2x J 11 L a2x a2i J 

2N_y 2 .V _ + N 2 V 2 /2 
0 ax a2 0 a2 / 


Therefore, 


T. = 2N n [E (1 + V J + E V 0 ] + 26 N 2 (l + V ,)V , 
4 . 0 L a^ al J s a2 J a 0^ al J a2 

a 

- 2N-R V , <■ 2N.E V _ + 5 N 2 V 2 
0 as a2 0 a a2 a 0 a2 
a 


Evaluation of ' y_,f z*. z*) 

J 11 a p 

Having completed the derivation of the individual terms of y (z*,z*) 

11 a p 

in the previous sections, we may collect the terms as defined in Equation A2.5. 
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Covariance, y^ = y + s a> y = y + 


|I ll ( 2 a- 2 ? ' W'aB + r s J * X aVas„ 

a_ p_ ■ p 

+ X X r 0 + X X r - 
a g gs^ a g ag 

Variance, y - v + s 
ff a - .a 


iu(z*) = X 2 [6 (1 + V J 2 + d (1 + V -)] + 2X X r (1 + V .) 
2 v a' a L a' - al y s ^ al- ,J a a as v al J 

a a 

+ P{5 V _[2(1 + V .) + V „] + d (1 + V , + V J 
a a a2 L ^ al a2 J a k al a2 


+ (d - r )V 0 } * V _ 
s as a2 a3 
a a 


(A2.6) 


CA2.7) 



APPENDIX III 


GLOSSARY OF SYMBOLS AND TERMS 


a. 

1 


a. . ,b. . 
iJ* ij 


A 


B 

c A (t) 
Corr( , ) 
Cov( , ) 
C 


the coefficient of cos in the 

finite Fourier series .approximation to 
x(t). 


signum of the difference z . 
Z Bi “ Z $y respectively. , al 


z . and 
aj 


the matrix of a filter A. 


the coefficient of sin — — in the 
finite Fourier series approximation to 
x(t). 


a matrix such that A = B'B. 

short hand for J-jif cos ' 

population correlation of the arguments. 

population covariance of the arguments. 

the matrix of an orthogonal linear trans- 
formation. 


d standardized energy: 2 E/Nq. 

standardized energy of the signal vector s. 



d 


a 


d 1 


standardized energy of the signal vector s 
presented to observer a. If s is the nulf 
signal 0, d =0. 

a 

standardized energy of the memory- specified 
reference signal 

the square-root of the standardized energy d 


d* 

s 


/r 

■ s 


d z 


sensitivity index of the decision variable z 



d» 


z 


for observer a. 
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det . 

determinant . 

D 

a 

a projection operator matrix for observer a. 

D *e 

the product D D a . 

a 3 

Double-probability paper 

graph paper linear in both coordinates with 
standard normal deviation scores . 

e 3 

an error term added to the decision variable 
z. 

e 

a random error vector. 

e al 

- a random error vector added to the modified 
input y for observer a. 

e a2 

a random error vector added to the memory- 
specified reference signal y^. 

E 

the energy of a constant signal waveform 
of duration T. 

E 

s 

the energy of s. 

E 

a 

the energy of y^. 

E 

z 

a 

the signal energy necessary for the ideal 

observer to performance at the level as 

observer a who uses the decision variable z . 

a 

E(. ) 

the expectation operator. 

f( ) 

a density function of the argument . 

F 0 ( Z ) 

t 

the cumulative distribution function of z 
conditional on H Q . 

F 2 (2) 

the cumulative distribution function of z 
conditional on . 

F 

the frequency representation space. 

F 

a 

a subspace of F generated by the projection 
operator D . 

False-alarm rate 

same as P(F) . 

g (s) 

an a priori density function of the signal s. 
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G(s) 


the cumulative prior distribution of s. 


h 1 (x),h 0 (x) the distribution densities of the input 

known to an observer conditional upon 
and H Q , respectively. 

the experimenter-specified hypotheses of 
signal-plus-noise and noise-alone, respec- 
tively. 

Hit-rate same as P(H). 


I 


an identity matrix. 


Iso-bias curve jargon for the curve which cuts across 

a family of ROC curves at points of 
constant slope. 

In ( ) the natural logarithm of the argument. 

t(x) the likelihood ratio of x. 


V 2 


n. 

1 

n 


n 


the imprecision (variance) of an observer's 
specification of a single component of the 
signal in the frequency representation space. 

the j — entry in a noise vector sample n. 

a sample vector of the noise random vector 

n. 

a random noise vector. 


n, 


a sample noise vector explicitly repre- 
sented in F. 


n(t,n) 


a stochastic noise process specified by 
the random vector n as a function of time. 


N . 




the total average power of a bandlimited 
noise process. 

the noise power per unit bandwidth N/W. 

a multivariate normal density function with 
mean vector y and dispersion (variance - 
covariance) matrix %. 
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Observer 

P A 

PC 9 ) 

PCC) 

P(D) 

P(F) 

P(H) 

PCS) 

P(T) 

PCT a ),PCT e ) 
Psychometric function 


a subject or device in a detection task; 
when the stimulus is acoustic subjects 
are also called "listeners" and devices 
"receivers". 

the area under an ROC curve. 

the probability of an event. 

the probability of a correct decision. 

The meaning of "correct" depends upon the 
context . 

the probability that jointly observed 

differences have Different signs. D is 

the event a. .b. . < 0. 

13 13 

the probability of a "false- alarm", i.e., 

PCs ay H. |H_ true). P(F) is also referred 
to as the "false- alarm rate" or the "incorrect 
detection rate". 


the probability of a "hit", i.e., 

P(say H |H. true). P(H) is also referred 
to as the "hit rate" or the "probability of 
(correct) detection". 

the probability that jointly observed differences 

have the same sign. S is the event 

a- .b. . > 0. 
lj 13 

the probability that either one or both of 
jointly observed differences are zero. T 

is the event a. .b. . = 0. 

13 13 

the probability of a tie in two independent 
samples of a random variable. T is the 
event a.. = 0. T 0 is the event b. . = 0. 

’a graph of the relationship between a per- 
formance index and a physical measure of de- 
tectability. 


the precision matrix (? n + £ ) 

the precision matrix • 

the standardized inner product (or cross- 
correlation) of the vectors y and s, 
i.e. , 2\i'D s/N . u 
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r 


ct8 



R 1 ,R 0 



ROC curve 


s i {t) 

s(t) 


s 


S JL 


T 


T 




V V 

V v a2 


the standardized inner product of the 
vectors P q and p i.e., 2 U a 0 a gUg/ N o- 

the standardized inner product of the V 

vectors p and s i.e., 2p’I> s„. When 

a 8’ 1 a a 8 

= 0, r =0. When s D = s, r 

K J we k ' me 


r . 


as 


a response vector. 


acceptance regions in the representation 
space for the hypotheses and H^, respec- 
tively. is also called x the criterion 
region. 


the inner-product 

“iVe- 


(cross- correlation) 


the theoretical curve (the receiver operating 
characteristic) [?(F),P(H)] generated by a 
decision variable z for discrimination 

and H, 


between the hypotheses H 
short hand for sin 


1 

2tfit 


0* 


a constant signal waveform defined for 
0 < t < T. 


the Hibert transform of s(t). 

a signal vector representing s(t)._ 

the representation vector of s_j_(t) . 

the signal vector presented to observer a. 

Usually on H, trials s = s and on FL 

trials s = 0. 
a 

the time interval over which x(t) is repre- 
sented, Also, the duration of the presen- 
tation interval. 


the temporal representation space. 

the variance, relative to N /2, of a 
component of or e , if the observer 
is specified. 

the variance, relative to N^/2, of a component 
of 0 2 or e a2 . 



V V 
3’ a3 

Var( ) 
W 



x. 

1 

X(t) 

x(t) 

x(j/2 w) 

SB 


a? f 

ll*|| 

* 

1/ 

2/* 

z 

z 

c 


the variance of e_ or. e 

3 cto 

the population variance of the argument.. 

the bandwidth of the noise process . 

the equivalent square bandwidth of the 
(hypothetical) interval filter of observer 

a. 


•the minimum of W a and i.e. , the overlap 
in bandwidth of the interval filters for 
observers a and g. 

. . the i— entry in the vector x. 

a waveform; the input waveform- sample 
presented to an observer. 

the finite Fourier series approximation to 

x(t); 

the value of x(t) at t = j/2 W. 

a representation vector for x(t). 

the representation vector of x(t) explicitly 
with coordinates in the frequency represen- 
tation space F. 

the transpose of x. 

the length of x, i.e., '/x'x-. 

the sample space of vectors x. X may be 
interpreted as either T or F. 

the modified input vector x - jj . 

V + e y 

a decision variable. 

a particular value of z. 

the "perfect" decision variable. 


z 


the decision variable of observer a. 
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z 


aO 


z 


al 


z*‘ 

a 


z 

1 


a 

B 


Y 

6. 


ag 


a 

"'"a 

^ct) 


p,p 


ag 


P 1 ,P 0 


a 


2 


X 


m 


X 


n 



a value of z conditional on !I 0 . 

a 0 

a value of z conditional on H, . 

a 1 

the decision variable of observer a aug- 
mented by including various interval 
sources of error. 

the decision variable of the ideal 
observer. 

a label for an observer. 

(1) a label for an observer; 

(2) a cut-off value of a decision variable 
z which determines an acceptance region 



the Goodman- Kruskal coefficient gamma. 

short-hand for the bandwidth- time product 

W T. 
a 

short-hand for W 0 T. 

ctg 

the relative uncertainty parameter 
= M 0 /(M 0 *N 0 ). 

short hand for 1 - X . 

a 

the efficiency of an observer. 

the j— temporal interpolation function. 

the linear correlation between the decision 

variables z and z 0 . 

a g 

the correlation conditional upon or H^, 
respectively. x 

variance . 

the dispersion matrix % + J, . 

ns 

• the dispersion matrix of the noise process, 
the inverse matrix of % . 

the dispersion matrix of an observer's prior 
specification of the signal s. 



p * 


- Ill — 


T ’ T ae 


m 


n 



Kendall's tau. 

tau conditional upon or H^, respectively. 

the vector sum y + y . 

n a 

the mean vector o£ the noise process. 

same as s^ (for consistency- in notation) . 

the mean vector of the observer's prior 
specification of the signal. 

the vector sum y + e 

a a2 

a standardized moment of type (i,j). 
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